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(57) Abstract: The present invention discloses isolated nucleic acid molecules encoding a hyperimmune serum reactive antigen or 
a fragment thereof as well as hyperimmune serum reactive antigens or fragments thereof from S. agalactiae, methods for isolating 
such antigens and specific uses thereof. 



WO 2004/099242 



PCT/EP2004/004856 



S. agalactia^ .antigens I + II 

The present invention relates to isolated nucleic acid molecules, which encode antigens for Streptococcus 
agalactiae, which are suitable for use in preparation of pharmaceutical medicaments for the prevention 
and treatment of bacterial infections caused by Streptococcus agalactiae. 

Streptococcus agalactiae is a gram-positive bacterium, and belongs to the Group B Streptococci (GBS) based 
on its hemolysis pattern in blood agar. The organism is encapsulated, and capsule is an important 
element of pathogenicity. Capsules are antigenic and form the basis for classifying GBS by serotypes. 
Nine distinct GBS serotypes have been identified: la, lb, II, III, IV, V, VI, VII and VIIL Most S. agalactiae 
serotypes have been shown to cause serious disease, and the two most common serotypes - type III and V 
- are estimated to account for the majority of invasive disease worldwide. The ranking and serotype 
prevalence differs by age group and geographic area. In the US, GBS type III causes more than 50% of 
infant disease, type V about 40% of nonpregnant adult disease, and type la about a third of disease in any 
patient population. 

Streptococcus agalactiae is an important agent of human disease at the extremities of age and in those who 
have underlying disease. Group B Streptococci are the major cause of generalized and focal infections in 
the newborn infant. GBS is the predominant pathogen in newborns in the US since the 1970's. Bacterial 
infection can lead to life threatening diseases, such as sepsis, pneumonia and meningitis. Survivers can 
become permanently handicapped with hearing, learning and visual disabilities. Newborns usually 
acquire the organism intrapartum or during delivery from their GBS-colonized mothers. In addition, GBS 
are also a frequent cause of infections in pregnant women and in chronically ill and elderly patients, such 
as those suffering from diabetes, malignancies, immunodeficiencies, etc., (reviewed by Baiter, S. et al. in 
Gram positive pathogens ed. by Fischetti V.A. et al. ASM Press 2000, pp 154-160). 

10-35% of pregnant women are colonized with GBS, but are asymptomatic. However, GBS colonization is 
important because of the risk of vertical transmission. 50-70% of neonates born to colonized women - that 
is 5-15% of all newborns - become colonized by GBS during delivery. Colonization is a prerequisite for 
infection and disease. The most preterm infants are at the highest risk for invasive disease due to low 
maternal transfer of antibodies and immature immune system. GBS carriage during pregnancy may be 
chronic, intermittent, or transient. It is difficult to assess the duration of carriage, since women are 
screened once during a prenatal visit from the late trimester. Several studies^ suggest that there is a 
correlation (-90%) between the colonization status in the third trimester and at the time of delivery. 
Serotyping of the isolates indicates that persistence of the same type is most common (reviewed by Baiter, 
S. et al. in Gram positive pathogens ed. by Fischetti V.A. et al. ASM Press 2000, pp 154-160). 

Without preventive intervention, 1 to 2% of all neonates and 15% of neonates born to heavily colonized 
women develop invasive disease (sepsis, pneumonia and/or meningitis). In the US, GBS infections affect 
1-5 newborns/1000 live birth. About 17.000 cases of invasive GBS disease occurred in the US annually, 
and 7.500 occurred in newborns before prevention. GBS is the most common cause of bacterial meningitis 
accounting for ~ 40 % of all cases reported in this age group. The overall incidence of invasive GBS 
disease is 0.2-0.7/100.000 in the US. It is somewhat lower in Europe. Mortality without preventive 
intervention is 6% with invasive disease, every 16. infected newborn dies and 20% of survivers become 
permanently handicapped. 

The rates of serious group B strep infections are much higher among newborns than among any other age 
group. Nonetheless, serious group B strep infections occur in other age groups in both men and women. 
Among non-pregnant adults, rates of serious disease range from 4.1 to 7.2 cases per 100,000 populations. 
The average death rate for invasive infections (infections where the bacteria have entered a part of the 
body that is normally not exposed to bacteria) is 8-10% for adult's ages 18-64 and 15-25% for adults 65 
years of age and over. Mortality rates are lower among younger adults, and adults who do not have other 
medical conditions. The rate of serious group B strep disease increases with age. The average age of cases 
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in non-pregnant adults is about 60 years old. Most adult group B strep disease occurs in adults who have 
serious medical conditions. These include: diabetes mellitus; liver disease; history of stroke; history of 
cancer; or bedsores. Among the elderly, rates of serious group B strep disease are more common among 
residents of nursing facilities, and among bedridden hospitalized patients. Group B strep disease among 
non-pregnant adults may often be acquired after recent trauma, or after having certain invasive hospital 
procedures like surgery ({Farley, M., 2001}; {Jackson, L. et al., 1995}; www.cdc.gov/groupbstrep/). 

Direct medical costs of neonatal disease before prevention were $294 million annually and GBS continues 
to pose a considerable economic burden. 

A definitive diagnosis of infection with Streptococcus agalactiae generally relies on isolation of the 
organism from cervical swabs, blood or other normally sterile body sites. Tests are also available to detect 
capsular polysaccharide antigen in body fluids. 

Penicillin G is the treatment of choice for established cases of GBS. Ten days of treatment is 
recommended for bacteremia, pneumonia and soft tissue infections, while 2-3 weeks is recommended for 
meningitis and 3-4 weeks for osteomyelitis. 

Prevention has been established since 1994 in North America by screening pregnant women for carriage 
of GBS, taking vaginal and anorectal swabs at 35-37 weeks' gestation, or by identifying risk factors at 
admission for delivery without cultures. Women who are candidates for prophylaxis are given 
intrapartum antibiotic therapy during labor to prevent early-onset neonatal disease. This prevention 
method has decreased the incidence of GBS disease from 1.7 to 0.4/1000 live births between 1993 and 1999 
in the US. Although most neonatal GBS disease can be prevented through intrapartum prophylaxis 
(Penicillin G or Ampicillin), currently available strategies are not ideal, especially for the prevention of 
late-onset (>7 days of age) infections and disease in premature babies. There are always individuals who 
escape of screening for carriage due to several reasons, such as intermittent carriers, who are tested 
negative at wks 32-35, but become positive during delivery, unattendance, negligence, or delivery before 
screening date (32-35 wks). 

In the long run, widespread use of antibiotics usually induces resistant strains that appear after a period 
of time. Extensive use of Penicillin (every 3-5 th women are treated with high dose), and other antibiotics 
has already been shown to steadily increase the percentage of antibotic resistant clinical isolates (ref). 
Moreover, efficiency of antibiotic based prevention is not that effective for late onset disease, as it is for 
early onset (within 48 hrs after delivery). An additional concern is that prevention in susceptible adult 
populations has not been addressed. 

Vaccine development is hindered by the lack of sufficient knowledge about the elements of protective 
immunity against GBS carriage and disease. The relationship of carriage to the development of natural 
immunity is poorly understood. In addition, the immunologic mechanism that allows disease to occur in 
a carrier is ill defined. However, it is suggested that the maternal serum levels of pathogen-specific 
antibodies are correlated with neonatal GBS disease. It has been firmly established that there is an inverse 
correlation between maternal anti-capsular polysaccharide antibody levels at delivery and the frequency 
of invasive neonatal diseases {Campbell, J. et al., 2000}. 

Although the group B carbohydrate antigen is common to all strains of GBS, unfortunately, it is not 
strongly immunogenic and antibodies are not protective from lethal challenge in experimental models. 
The GBS capsule itself that is made of polysaccharides, is immunogenic and is able to induce protective 
antibodies. However, this protection is type-specific. Although capsular specific antibodies have been 
shown to be highly protective, it remains unclear what concentration of these serotype-specific antibodies 
protect against disease and more recently it has become clear that opsonic activity and avidity of these 
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antibodies are more critical determinants of protection than concentration. 

The importance of surface proteins in human immunity to S. agalactiae already has been appreciated. It is 
apparent that all serotypes express surface proteins with activity relevant to host immune defense. The 
alpha C protein, beta C protein, Rib and Sip proteins are well-characterized biochemically and 
genetically, and have also been shown to immunogenic and protective in animal models ({Michel, J. et al., 
1991}; {Brodeur, B. et aL, 2000}; {Larsson, C. et al., 1999}; {Cheng, Q. et al., 2002}). The major problem with 
these proteins as vaccine candidates seems to be their variability in prevalenve among the different 
clinical isolates of GBS. The Rib protein for example is present in serotype III GBS, but missing from type 
V, which responsible for significant portion of disease worldwide. Some other surface proteins are 
characterized as being immunogenic, but there is a limited systematic work done to identify most of the 
immunogenic proteins of GBS. 

Thus, there remains a need for an effective treatment to prevent or ameliorate GBS infections. A vaccine 
could not only prevent infections by GBS, but more specifically prevent or ameliorate colonization of host 
tissues (esp. in the birth canal), thereby reducing the incidence of transmission from mother to fetus. 
Reducing the incidence of acute infection and carriage of the organism would lead to prevention of 
invasive diseases in newborns - pneumonia, bacteremia, meningitis, and sepsis. Vaccines capable of 
showing cross-protection against the majority of S. agalactiae strains causing human infections could also 
be useful to prevent or ameliorate infections caused by all other streptococcal species, namely groups A, 
C and G. 

A vaccine can contain a whole variety of different antigens. Examples of antigens are whole-killed or 
attenuated organisms, subfractions of these organisms/tissues, proteins, or, in their most simple form, 
peptides. Antigens can also be recognized by the immune system in form of glycosylated proteins or 
peptides and may also be or contain polysaccharides or lipids. Short peptides can be used since for 
example cytotoxic T-cells (CTL) recognize antigens in form of short usually 8-11 amino acids long 
peptides in conjunction with major histocompatibility complex (MHC). B-cells can recognize linear 
epitopes as short as 4-5 amino acids, as well as three-dimensional structures (conformational epitopes). In 
order to obtain sustained, antigen-specific immune responses, adjuvants need to trigger immune 
cascades that involve all cells of the immune system. Primarily, adjuvants are acting, but are not 
restricted in their mode of action, on so-called antigen presenting cells (APCs). These cells usually first 
encounter the antigen(s) followed by presentation of processed or unmodified antigen to immune effector 
cells. Intermediate cell types may also be involved. Only effector cells with the appropriate specificity are 
activated in a productive immune response. The adjuvant may also locally retain antigens and co-injected 
other factors. In addition the adjuvant may act as a chemoattractant for other immune cells or may act 
locally and/or systemically as a stimulating agent for the immune system. 

Vaccine development since the late 1970s has focused on the capsular polysaccharides, but a safe, 
effective product is still not available. However, vaccine against S. agalactiae is ranked as one of the most 
important for development and administration to infants and high-risk adults. Currently vaccines against 
this infection are only in the research stages of development. Efforts are focused on using capsular 
polysaccharide (CPS) as immunogens, either with or without conjugation to protein {Paoletti, L. et al., 
2002}. However, there are several arguments against the use of polysaccharide-based vaccine. 
Polysaccharides induce IgG2 antibodies, which cross the placenta less efficiently then IgGl or IgG3 
antibodies. It is especially a problem for the most susceptible neonates, the still-borns since placental 
antibody transfer is low before weeks 32-34. It is estimated that -10% of deliveries occur before the 34 th 
pregnancy week. 

Protein conjugate vaccines are no doubt a great new addition to the amarmatorium in the battle against 
GBS disease, but the vaccine can contain only a limited number of GBS serotypes and given adequate 
ecological pressure, replacement disease by non-vaccine serotypes remains a real threat, particularly in 
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areas with very high disease burden. Morover polysaccharide antigens used for active immunization do 
not provide immunological memory in humans. Conjugation of CPS to non-GBS related immunogenic 
protein carriers (e.g. tetanus toxoid, cholera toxin B subunit, etc.) has been shown to beneficial in 
inducing higher concentrations of antibodies in vaccinees, but it does not provide pathogen-specific B cell 
and T cell epitopes which would recruit memory B and T cells during a real infection to support the most 
effective host response. To be able to supplement the CPS vaccines with proteins fulfilling these criteria it 
is necessary to identify conserved immunogenic GBS-specific surface proteins. 

All these insufficiencies suggest that there is a need to develop new generation vaccines composed of 
proteins, or their derivatives, expressed by all strains under in vivo conditions with the ability to induce 
opsonizing and/or neutralizing antibodies in humans. 

There is a great potential for passive antibody-based therapy. There have been already attempts to use 
human intravenous immunoglobulin (IVIG) preparations for prevention. Recent advances in the 
technology of monoclonal antibody production provide the means to generate human antibody reagents 
and reintroduce antibody therapies, while avoiding the toxicities associated with serum therapy. 
Immunoglobulins are an extremely versatile class of antimicrobial proteins that can be used to prevent 
and treat emerging infectious diseases. Antibody therapy has been effective against a variety of diverse 
microorganisms (reviewed in {Burnie, J. et al., 1998}). Anti-GBS mAbs could be given therapeutically to 
every newborn that develop invasive diseases or preventively to low birth-weight and premature 
neonates. 

During the last decade the immunogenicity and protective capacity of several GBS proteins have been 
described in animal models and these are now being explored for the development of species-common 
protein based vaccines. Such proteins are the GBS surface proteins Sip {Brodeur, B. et al., 2000}, rib, <$>- 
protein and {Michel, J. et al., 1991}. 

Certain proteins or enzymes displayed on the surface of gram-positive organisms significantly contribute 
to pathogenesis, are involved in the disease process caused by these pathogens. Often, these proteins are 
involved in direct interactions with host tissues or in conceiling the bacterial surface from the host 
defense mechanisms {Navarre, W. et al., 1999}. S. agalactiae is not an exception in this regard. Several 
surface proteins are characterized as virulence factors, important for GBS pathogenicity ((reviewed in 
(Paoletti L.C. et al. in Gram positive pathogens, ed. by Fischetti V.A et al., ASM Press 2000, pp 137-153); 
{Paoletti, L. et al v 2002}). If antibodies to these proteins could offer better protection to humans then 
polysaccharides, they could provide the source of a novel, protein-based GBS vaccine to be used in 
conjunction with or in place of the more traditional capsular polysaccharise vaccine. The use of some of 
the above-described proteins as antigens for a potential vaccine as well as a number of additional 
candidates resulted mainly from a selection based on easiness of identification or chance of availability. 
There is a demand to identify relevant antigens for S. agalactiae in a more comprehensive way. 

The present inventors have developed a method for identification, isolation and production of 
hyperimmune serum reactive antigens from a specific pathogen, especially from Staphylococcus aureus 
and Staphylococcus epidermidis (WO 02/059148). However, given the differences in biological property, 
pathogenic function and genetic background, Streptococcus agalactiae is distinctive from Staphylococcus 
strains. Importantly, the selection of sera for the identification of antigens from S. agalactiae is different 
from that applied to the S. aureus screens. Four major types of human antibody sources were collected for 
that purpose. First, healthy pregnant women who were tested negative for cervical and anorectal carriage 
of GBS. This donor group represents the most important source of antibodies. In addition to their serum 
samples, human cervical secretions collected with cervical wicks, containing secretory IgA (slgA) were 
also used for antigen idntification and validation. The main value of this collection is that slgA can be 
considered the major immune effector molecule on mucosal surfaces. Second, healthy pregnant women 
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colonized with GBS whos newborn remained GBS-free (although with antibiotic prevention). Third, 
adults below <45 years of age without clinical disease. Four, naive individuals, youg children between 5 
and 10 months of age, after they already lost maternal antibodies and have not acquired GBS-specific 
ones due to the lack of GBS disease. 

To be able to select for relevant serum sources, a series of ELISAs and immunoblotting experiments 
measuring anti-S. agalactiae IgG and IgA antibody levels were performed with bacterial lysates and 
culture supernatant proteins. Sera from high titer carriers and non-carriers were included in the genomic- 
based antigen identification. This approach for selection of human sera is basically very different from 
that used for S. aureus, where carriage or non-carriage state couldn't be associated with antibody levels. 

The present invention uses high throughput genomic method to identify in vivo expressed pathogen- 
specific proteins with the ability to induce antibodies in humans during the course of infections and 
colonization. 

The genomes of the two bacterial species S. agalactiae and S. aureus by itself show a number of important 
differences. The genome of S. agalactiae contains app. 2.2 Mb, while S. aureus harbours 2.85 Mb. They have 
an average GC content of 35.7 and 33%, respectively and approximately 30 to 45% of the encoded genes 
are not shared between the two pathogens. In addition, the two bacterial species require different growth 
conditions and media for propagation. A list of the most important diseases, which can be inflicted by the 
two pathogens is presented below. S. aureus causes mainly nosocomial, opportunistic infections: 
impetigo, folliculitis, abscesses, boils, infected lacerations, endocarditis, meningitis, septic arthritis, 
pneumonia, osteomyelitis, scalded skin syndrome (SSS), toxic shock syndrome. S. agalactiae causes mainly 
neonatal infections and diseases in elderly, such as bacteremia, sepsis, wound infection, osteomyelitis and 
meningitis. 

The complete genome sequence of a capsular serotype III isolate of S. agalactiae, designated NEM316 
(ATCC 12403) was determined by the random shotgun sequencing strategy (GenBank accession number 
AL732656; see www.tigr.org/tigrscripts/CMR2/CMRHomePage.spl). {Glaser, P. et al., 2002}. 

Hie problem underlying the present invention was to provide means for the development of 
medicaments such as vaccines against S. agalactiae infection. More particularly, the problem was to 
provide an efficient, relevant and comprehensive set of nucleic acid molecules or hyperimmune serum 
reactive antigens from S. agalactiae that can be used for the manufacture of said medicaments. 

Therefore, the present invention provides an isolated nucleic acid molecule encoding a hyperimmune 
serum reactive antigen or a fragment thereof comprising a nucleic acid sequence, which is selected from 
the group consisting of: 

a) a nucleic acid molecule having at least 70% sequence identity to a nucleic acid molecule selected 
from Seq ID No 14, 90, 157-216. 

b) a nucleic acid molecule which is complementary to the nucleic acid molecule of a), 

c) a nucleic acid molecule comprising at least 15 sequential bases of the nucleic acid molecule of a) 
or b) 

d) a nucleic acid molecule which anneals under stringent hybridisation conditions to the nucleic 
acid molecule of a), b), or c) 

e) a nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nucleic acid molecule defined in a), b), c) or d). 

According to a preferred embodiment of the present invention the sequence identity is at least 80%, 
preferably at least 95%, especially 100%. 

Furthermore, the present invention provides an isolated nucleic acid molecule encoding a hyperimmune 
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serum reactive antigen or a fragment thereof comprising a nucleic acid sequence selected from the group 
consisting of 

a) a nucleic acid molecule having 96 % or more than 96 %, preferably at least 98 %, especially 100 % 
sequence identity to a nucleic acid molecule selected from Seq ID No 1, 3, 5-13, 15, 18-25, 27-31, 
33-36, 39-68, 70-85, 92-100, 103-126, 128-145, 147, 149-156, 217, 435-448 and 463-474. 

b) a nucleic acid molecule which is complementary to the nucleic acid molecule of a), 

c) a nucleic acid molecule comprising at least 15 sequential bases of the nucleic acid molecule of a) 
orb) 

d) a nucleic acid molecule which anneals under stringent hybridisation conditions to the nucleic 
acid molecule of a), b) or c), 

e) a nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nucleic acid defined in a), b), c) or d). 

According to another aspect, the present invention provides an isolated nucleic acid molecule encoding a 
hyperimmune serum reactive antigen or a fragment thereof comprising a nucleic acid sequence selected 
from the group consisting of 

a) a nucleic acid molecule having 98 % or more than 98%, escpecially 100 % sequence identity to a 
nucleic acid molecule selected from Seq ID No 32, 86, 91, 101, 127. 

b) a nucleic acid molecule which is complementary to the nucleic acid of a), 

c) a nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nucleic acid defined in a), b), c) or d). 

Preferably, the nucleic acid molecule is DNA or RNA. 

According to a preferred embodiment of the present invention, the nucleic acid molecule is isolated from 
a genomic DNA, especially from a S. agalactiae genomic DNA. 

According to the present invention a vector comprising a nucleic acid molecule according to any of the 
present invention is provided. 

In a preferred embodiment the vector is adapted for recombinant expression of the hyperimmune serum 
reactive antigens or fragments thereof encoded by the nucleic acid molecule according to the present 
invention. 

The present invention also provides a host cell comprising the vector according to the present invention. 

According to another aspect the present invention further provides a hyperimmune serum-reactive 
antigen comprising an amino acid sequence being encoded by a nucleic acid molecule according to the 
present invention. 

In a preferred embodiment the amino acid sequence (polypeptide) is selected from the group consisting 
of Seq ID No 231, 307, 374-433. 

In another preferred embodiment the amino acid sequence (polypeptide) is selected from the group 
consisting of Seq ID No 218, 220, 222-230, 232, 235-242, 244-248, 250-253, 256-285, 287-302, 309-317, 320- 
343, 345-362, 364, 366-373, 434, 449-462 and 475-486. 

In a further preferred embodiment the amino acid sequence (polypeptide) is selected from the group 
consisting of Seq ID No 249, 303, 308, 318, 344. 

According to a further aspect the present invention provides fragments of hyperimmune serum-reactive 
antigens selected from the group consisting of peptides comprising amino acid sequences of column 
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"predicted immunogenic aa" and "location of identified immunogenic region" of Table 1A, especially 
peptides comprising amino acid 4-20, 35-44, 65-70, 73-87, 92-98, 112-137, 152-161, 177-186, 193-200, 206- 
213 229-255, 282-294, 308-313, 320-326, 349-355, 373-384, 388-406, 420-425 and 115-199 of Seq ID No 218; 
5-24, 35-41, 44-70, 73-89, 103-109, 127-143, 155-161, 185-190, 192-207, 212-219, 246-262, 304-336, 372-382, 
384-393 398-407, 412-418, 438-444, 1-75, 76-161 and 164-239 of Seq ID No 219; 4-10, 16-58, 60-71, 77-92, 
100-126, 132-146, 149-164, 166-172, 190-209, 214-220, 223-229, 241-256, 297-312, 314-319, 337-343, 351-359, 
378-387, 398-418, 421-428, 430-437, 440-448, 462-471, 510-519, 525-536, 552-559, 561-568, 573-582, 596-602, 
608-630, 637-649, 651-665, 681-702, 714-732, 739-745, 757-778, 790-805, 807-815, 821-829, 836-842, 846-873, 
880-903, 908-914, 916-923, 931-940, 943-948, 956-970, 975-986, 996-1015, 1031-1040, 1051-1069, 1072-1095, 
1114-1119 1130-1148, 1150-1157, 1169-1176, 1229-1238 and 802-812 of Seq ID No 220; 5-12, 14-26, 35-47, 
52-67, 72-78, 83-98, 121-141, 152-159, 163-183, 186-207, 209-257, 264-277, 282-299, 301-309, 312-318, 324-339, 
358-368, 372-378, 387-397, 425-431 and 46-291 of Seq ID No 221; 29-38, 44-64, 70-76, 78-87, 94-100, 102-112, 
119-134, 140-149, 163-173, 178-186, 188-194, 207-234, 247-262, 269-290 and 73-92 of Seq ID No 222; 10-28, 
36-63, 77-87, 103-119, 127-136, 141-169, 171-183, 195-200, 207-232, 236-246, 251-265, 268-283, 287-297, 314- 
322 335-343, 354-363, 384-390, 405-411, 419-436, 443-455, 467-473, 480-513, 518-529, 550-557, 565-585, 602- 
608 616-625, 632-660, 665-677, 685-701, 726-736, 738-747, 752-761, 785-796, 801-813, 838-853, 866-871 and 
757-774 of Seq ID No 223; 31-38, 61-66, 74-81, 90-115, 123-145, 154-167, 169-179, 182-193, 200-206, 238-244, 
267-272 and 235-251 of Seq ID No 224; 19-25, 38-54, 56-64, 66-72, 74-92, 94-100, 116-129, 143-149, 156-183, 
204-232, 253-266, 269-275, 294-307 and 241-313 of Seq ID No 225; 5-34, 50-56, 60-65, 74-85, 89-97, 108-119, 
159-165, 181-199, 209-225, 230-240, 245-251, 257-262, 274-282, 300-305 and 64-75 of Seq ID No 226; 5-13, 
16-21, 27-42, 45-52, 58-66, 74-87, 108-114, 119-131 and 39-51 of Seq ID No 227; 6-23, 46-54, 59-65, 78-84, 
100-120, 128-133, 140-146, 159-165, 171-183, 190-204, 224-232, 240-248, 250-259, 274-280, 288-296, 306-315 
and 267-274 of Seq ID No 228; 5-12, 15-24, 26-36, 42-65, 68-80, 82-104, 111-116, 125-144, 159-167, 184-189, 
209-218, 235-243, 254-265, 269-283, 287-300, 306-316, 318-336, 338-352, 374-392 and 162-174 of Seq ID No 
229; 30-42, 45-54 and 25-37 of Seq ID No 230; 10-30, 53-59, 86-95, 116-130, 132-147, 169-189, 195-201, 212- 
221, 247-256, 258-265, 278-283, 291-298, 310-316, 329-339, 341-352, 360-367, 388-396, 398-411, 416-432, 443- 
452^ 460-466, 506-512, 515-521, 542-548 and 419-431 of Seq ID No 231; 4-27, 30-53, 60-67, 70-90, 92-151, 
159-185, 189-195, 198-210, 215-239 and 173-189 of Seq ID No 232; 4-26, 41-54, 71-78, 116-127, 140-149, 151- 
158, 161-175, 190-196, 201-208, 220-226, 240-252, 266-281, 298-305, 308-318, 321-329, 344-353, 372-378, 384- 
405' 418-426, 429-442, 457-463, 494-505, 514-522 and 174-188 of Seq ID No 233; 17-25, 27-39, 61-67, 81-89, 
99-110, 120-131, 133-139, 147-161, 167-172, 179-185, 192-198, 203-213, 226-238, 243-258, 261-267, 284-290, 
296-307 311-328, 340-352, 356-371 and 239-256 of Seq ID No 234; 8-30, 40-49, 67-80, 114-123, 126-142, 152- 
162, 188-194 and 57-70 of Seq ID No 235; 4-23, 28-34, 36-47, 50-61, 76-81, 89-94, 96-104, 112-119, 126-146, 
155-181, 195-200, 208-214, 220-229, 244-260, 263-276, 282-288, 292-300, 317-323, 336-351, 353-359, 363-375, 
382-399, 415-432, 444-455, 458-471, 476-481, 484-492, 499-517, 522-529, 535-541, 543-568, 572-584, 586-600, 
607-617, 626-637, 656-675 and 282-297 of Seq ID No 236; 6-24, 30-35, 38-45, 63-91, 134-140, 146-160, 167- 
188, 214-220, 226-234, 244-250, 260-270, 286-301, 316-329, 340-371, 429-446, 448-459, 474-481, 485-491, 512- 
526 537-544, 550-565, 573-583, 596-613, 621-630, 652-658 and 87-97 of Seq ID No 237; 8-20, 26-48, 56-67, 
76-86, 94-109, 115-121, 123-129, 143-160, 178-186, 191-198, 201-208, 221-236, 238-244, 260-268 and 237-247 
of Seq ID No 238; 4-40, 42-57, 73-87, 98-117, 126-135, 150-156, 166-174, 196-217, 231-236, 248-258, 276-284, 
293-301 307-313, 339-347, 359-365, 375-387, 395-402, 428-440, 445-456, 485-490, 497-505, 535-541, 547-555, 
610-625, 648-656, 665-671 and 448-528 of Seq ID No 239; 10-18, 39-45, 51-61, 80-96, 98-106, 110-115, 158- 
172 174-183, 191-200, 220-237, 249-255, 274-289, 308-324, 331-341, 372-381, 384-397, 405-414 and 322-338 of 
Seq ID No 240; 30-36, 38-56, 85-108, 134-147, 149-160, 163-183, 188-201, 206-211, 219-238, 247-254 and 5-13 
of Seq ID No 241; 11-40, 98-103, 110-115, 133-145, 151-159, 172-179, 192-201, 204-212, 222-228, 235-245, 
258-268, 283-296, 298-309, 322-329, 342-351, 354-362, 372-378, 385-393, 407-418, 495-516 and 1-148 of Seq 
ID No 242; 5-19, 21-36, 73-94, 112-119, 122-137, 139-145, 152-167, 184-190, 198-204, 208-224, 249-265, 267- 
281, 299-304, 309-317, 326-333, 356-364, 368-374, 381-389, 391-414, 419-425, 430-435 and 113-140 of Seq ID 
No 243; 45-54, 59-67, 78-91 and 15-23 of Seq ID No 244; 11-22, 33-47, 52-80, 88-112, 124-129 and 6-25 of 
Seq ID No 245; 26-41, 51-63, 80-89, 93-115, 150-163, 187-193, 220-237, 240-249, 286-294, 296-306, 316-329, 
345-353, 361-370, 407-425, 428-437, 474-482, 484-494, 504-517, 533-541, 549-558, 595-613, 616-625, 660-668, 
673-685, 711-726, 736-744, 749-761, 787-802, 812-820, 825-837, 863-878, 888-896, 901-913, 939-954, 964-972, 
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977-989, 1003-1008, 1016-1022, 1028-1034, 1041-1053, 1059-1074, 1101-1122, 420-511 and 581-704 of Seq ID 
No 246-18-25, 27-55, 71-83, 89-95, 102-113, 120-146, 150-156, 174-185 and 159-175 of Seq ID No 247; 24-30, 
38-56, 63-68, 87-93, 136-142, 153-164, 183-199, 213-219, 226-234, 244-261, 269-278, 283-289, 291-297, 320-328, 
330-336, 340-346, 348-356, 358-366, 382-387, 401-408, 414-419, 449-455, 468-491, 504-512, 531-537, 554-560, 
597-608 621-627, 632-643, 650-662, 667-692, 703-716, 724-737, 743-758, 783-794, 800-818, 846-856 and 806- 
884 of Seq ID No 248; 4-14, 21-39, 86-92, 99-107, 121-131, 136-144, 147-154, 158-166, 176-185, 193-199, 207- 
222 224-230 and 117-136 of Seq ID No 249; 65-76, 85-97, 103-109, 115-121, 125-146, 163-169, 196-205, 212- 
219 228-237, 241-247, 254-262, 269-288, 294-303, 305-313, 328-367, 395-401, 405-412, 418-429, 437-447, 481- 
488 506-513, 519-524, 530-541, 546-557 and 266-284 of Seq ID No 250; 5-14, 37-42, 49-71, 78-92, 97-112, 
127-136 147-154, 156-163, 186-198, 216-225, 233-243, 248-253, 295-307, 323-332, 359-366, 368-374, 380-398 
and 194-223 of Seq ID No 251; 4-11, 33-39, 45-72, 100-113, 119-129, 136-144, 169-175, 177-185, 200-208, 210- 
219, 262-276, 278-297, 320-326, 336-344, 347-362, 381-394, 443-453 and 438-454 of Seq ID No 252; 4-29, 31- 
52 55-61, 95-110, 138-158, 162-171, 179-187, 202-229, 239-248, 251-256, 262-267, 269-285, 304-310, 351-360, 
362-368, 381-388, 415-428, 435-440, 448-458 and 161-178 of Seq ID No 253; 4-17, 19-28, 32-43, 47-59, 89-110, 
112-126, 128-134, 140-148, 152-161, 169-184, 191-204, 230-235, 255-264, 328-338, 341-347, 401-409, 413-419, 
433-44l' 449-458, 463-468, 476-482, 486-492, 500-506, 529-545 and 305-381 of Seq ID No 254; 10-29, 38-45, 
53-61, 134-145, 152-160, 163-170, 202-208, 219-229, 248-258, 266-275, 282-288, 315-320, 328-334, 377-385, 
392-402 418-424, 447-453, 460-471, 479-487, 491-497, 500-507, 531-537, 581-594, 615-623, 629-635, 644-652, 
659-666 668-678, 710-717, 719-728, 736-741, 747-760, 766-773, 784-789, 794-800, 805-817, 855-861, 866-887 
and 698-715 of Seq ID No 255; 16-26, 29-37, 44-58, 62-68, 74-80, 88-95, 97-120, 125-144, 165-196 and 58-72 
of Seq ID No 256; 14-21, 23-46, 49-60, 63-74, 78-92, 96-103, 117-129, 134-161, 169-211, 217-231, 239-248, 252- 
281, 292-299, 313-343 and 243-257 of Seq ID No 257; 11-27, 46-52, 67-72, 76-84, 91-112, 116-153, 160-175, 
187-196 202-211, 213-220 and 43-76 of Seq ID No 258; 5-29, 37-56, 78-86, 108-118, 152-161 and 120-130 of 
Seq ID No 259; 8-14, 19-41, 52-66, 75-82, 87-92, 106-121, 127-133, 136-143, 158-175, 180-187, 196-204, 221- 
228, 239-245, 259-265, 291-306, 318-323, 328-340, 352-358, 361-368, 375-381, 391-399, 411-418, 431-442, 446- 
455 484-496, 498-510, 527-533, 541-549, 558-565, 575-585, 587-594, 644-655, 661-668, 671-677 and 184-196 of 
Seq ID No 260; 4-22, 29-38, 55-62, 75-81, 102-107, 110-134, 143-150, 161-167, 172-179, 191-215, 223-233, 241- 
247, 251-264, 266-272, 288-309, 340-352, 354-366, 394-402, 414-438 and 198-218 of Seq ID No 261; 24-44, 49- 
70 80-91, 105-118, 128-136, 140-154 and 77-92 of Seq ID No 262; 5-22, 31-36, 41-47, 67-74, 83-90, 105-122, 
135-143, 160-167 and 118-129 of Seq ID No 263; 4-25, 33-73, 81-93, 96-106, 114-120, 122-128, 130-172, 179- 
208, 210-241, 251-283, 296-301 and 92-100 of Seq ID No 264; 14-24, 29-38, 43-50, 52-72, 86-97, 101-107, 110- 
125, 127-141, 145-157, 168-175, 177-184, 186-195, 205-226, 238-250, 255-261, 284-290, 293-304, 307-314, 316- 
323 325-356, 363-371, 383-390, 405-415, 423-432, 442-454, 466-485, 502-511, 519-527, 535-556, 558-565, 569- 
574' 612-634, 641-655, 672-686, 698-709, 715-722, 724-732, 743-753, 760-769, 783-792, 818-825, 830-839, 842- 
849 884-896, 905-918, 926-940, 957-969, 979-1007, 1015-1021, 1049-1057 and 336-349 of Seq ID No 265; 6- 
16 26-31, 33-39, 62-73, 75-85, 87-100, 113-123, 127-152, 157-164, 168-181, 191-198, 208-214, 219-226, 233-254, 
259-266, 286-329 and 181-195 of Seq ID No 266; 4-13, 32-39, 53-76, 99-108, 110-116, 124-135, 137-146, 149- 
157, 162-174, 182-190, 207-231, 242-253, 255-264, 274-283, 291-323, 334-345, 351-360, 375-388, 418-425, 456- 
474 486-492, 508-517, 520-536, 547-560, 562-577, 31-45 and 419-443 of Seq ID No 267; 15-26, 30-37, 42-49, 
58-90, 93-99, 128-134, 147-154, 174-179, 190-197, 199-205, 221-230, 262-274, 277-287, 300-314, 327-333, 343- 
351, 359-377, 388-396, 408-413, 416-425, 431-446 and 246-256 of Seq ID No 268; 5-26, 34-42, 47-54, 61-67, 
71-104, 107-115, 131-138, 144-153, 157-189, 196-202, 204-210, 228-245, 288-309, 316-329, 332-341, 379-386, 
393-399 404-412, 414-421, 457-468, 483-489, 500-506, 508-517, 523-534, 543-557, 565-580, 587-605, 609-617, 
619-627, 631-636, 640-646, 662-668, 675-682, 705-710, 716-723, 727-732, 750-758, 784-789, 795-809, 869-874, 
14-138, 166-286, 372-503, 674-696 and 754-859 of Seq ID No 269; 5-17, 32-38, 40-47, 80-89, 113-119, 125-137, 
140-154, 157-163, 170-177, 185-199, 213-225, 228-236, 242-248, 277-290, 292-305, 323-333, 347-353, 364-370, 
385-394, 399-406, 423-433, 441-451, 462-474, 477-487 and 116-124 of Seq ID No 270; 7-16, 18-30, 32-49, 53- 
61, 63-85, 95-101, 105-115, 119-134, 143-150, 159-178, 185-202, 212-229, 236-250, 254-265, 268-294 and 63-72 
of Seq ID No 271; 4-12, 19-47, 73-81, 97-103, 153-169, 188-198, 207-213, 217-223, 236-242, 255-265, 270-278, 
298-305, 309-317, 335-347, 354-363, 373-394, 419-424, 442-465, 486-492, 500-507, 542-549, 551-558, 560-572, 
580-589, 607-614, 617-623, 647-653, 666-676, 694-704, 706-714, 748-754, 765-772, 786-792, 795-806 and 358- 
370 of Seq ID No 272; 18-28, 30-38, 40-46, 49-55, 69-78, 82-98, 104-134, 147-153, 180-190, 196-202, 218-236, 
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244-261, 266-273, 275-286, 290-295, 301-314, 378-387, 390-395, 427-434 and 290-305 of Seq ID No 273; 4-13, 
20-31, 39-51, 54-61, 69-84, 87-105, 117-124 and 108-125 of Seq ID No 274; 24-34, 43-54, 56-66, 68-79 and 50- 
69 of Seq ID No 275; 5-43, 71-77, 102-131, 141-148, 150-156, 159-186, 191-207, 209-234, 255-268, 280-286, 
293-299, 317-323, 350-357, 363-372, 391-397, 406-418, 428-435, 455-465, 484-497, 499-505, 525-531, 575-582, 
593-607, 621-633, 638-649, 655-673, 684-698, 711-725, 736-741, 743-752, 759-769, 781-793, 813-831, 843-853, 
894-905, 908-916, 929-946, 953-963, 970-978, 1001-1007, 1011-1033, 165-178 and 818-974 of Seq ID No 276; 
16-44, 63-86, 98-108, 185-191, 222-237, 261-274, 282-294, 335-345, 349-362, 374-384, 409-420, 424-430, 440- 
447, 453-460, 465-473, 475-504, 522-534, 538-551, 554-560, 567-582, 598-607, 611-619, 627-640, 643-653, 655- 
661, 669-680, 684-690, 701-707, 715-731, 744-750, 756-763, 768-804, 829-837, 845-853, 855-879, 884-890, 910- 
928, 77-90, 144-212, 279-355, 434-536, 782-810 and 875-902 of Seq ID No 277; 4-22, 29-41, 45-51, 53-66, 70- 
77, 86-95, 98-104, 106-124, 129-135, 142-151, 153-161, 169-176, 228-251, 284-299, 331-337, 339-370, 380-387, 
393-398, 406-411, 423-433, 440-452, 461-469, 488-498, 501-516, 523-530, 532-559, 562-567, 570-602, 612-628, 
630-645, 649-659, 666-672, 677-696, 714-723, 727-747 and 212-227 of Seq ID No 278; 4-9, 17-31, 35-41, 56-61, 
66-75, 81-87, 90-124, 133-138, 149-163, 173-192, 213-219, 221-262, 265-275, 277-282, 292-298, 301-307, 333- 
346, 353-363, 371-378, 419-430, 435-448, 456-469, 551-570, 583-599, 603-612 and 275-291 of Seq ID No 279; 
28-34, 53-58, 72-81, 100-128, 145-154, 159-168, 172-189, 217-225, 227-249, 256-263, 299-309, 322-330, 361-379, 
381-388, 392-401, 404-417, 425-436, 440-446, 451-464, 469-487, 502-511, 543-551, 559-564, 595-601, 606-612, 
615-626, 633-642, 644-650, 664-670, 674-684, 692-701, 715-723, 726-734, 749-756, 763-771, 781-787, 810-843, 
860-869, 882-889, 907-917, 931-936, 941-948, 951-958, 964-971, 976-993, 1039-1049, 1051-1065, 1092-1121, 
1126-1132, 1145-1151, 1158-1173, 1181-1192, 1194-1208, 1218-1223, 1229-1243, 1249-1254, 1265-1279, 1287- 
1297, 1303-1320, 1334-1341, 1343-1358, 1372-1382, 1406-1417, 1419-1425, 1428-1434, 1441-1448, 1460-1473, 
1494-1504, 1509-1514, 1529-1550, 654-669 and 1400-1483 of Seq ID No 280; 10-16, 20-25, 58-65, 97-109, 118- 
132, 134-146, 148-155, 186-195, 226-233, 244-262, 275-284, 295-310, 317-322, 330-339, 345-351, 366-375, 392- 
403, 408-415, 423-430, 435-444, 446-457, 467-479, 486-499, 503-510, 525-537, 540-585, 602-612, 614-623, 625- 
634, 639-645, 650-669, 700-707, 717-724, 727-739, 205-230 and 733-754 of Seq ID No 281; 5-22, 37-43, 72-81, 
105-113, 128-133, 148-160, 188-194, 204-230, 238-245, 251-257 and 194-213 of Seq ID No 282; 16-21, 35-41, 
56-72, 74-92, 103-109 and 62-68 of Seq ID No 283; 4-15, 17-82, 90-104, 107-159, 163-170, 188-221, 234-245, 
252-265 and 220-235 of Seq ID No 284; 16-22, 36-46, 61-75, 92-107, 113-121, 139-145, 148-160 and 30-42 of 
Seq ID No 285; 4-12, 20-26, 43-49, 55-62, 66-78, 121-127, 135-141, 146-161, 164-170, 178-189, 196-205, 233- 
238, 269-279, 288-318, 325-332, 381-386, 400-407 and 328-346 of Seq ID No 286; 5-12, 31-49, 57-63, 69-79, 
89-97, 99-114, 116-127, 134-142, 147-154, 160-173, 185-193, 199-204, 211-222, 229-236, 243-249, 256-274 and 
58-68 of Seq ID No 287; 10-20, 28-34, 39-53, 68-79, 84-90, 99-106 and 73-79 of Seq ID No 288; 14-37, 45-50, 
61-66, 77-82, 93-98, 109-114, 125-130, 141-146, 157-162, 173-178, 189-194, 205-210, 221-226, 237-242, 253-258, 
269-274, 285-290, 301-306, 316-332, 349-359, 371-378, 385-406, 34-307 and 312-385 of Seq ID No 289; 4-10, 
17-38, 50-85, 93-99, 109-116, 128-185, 189-197, 199-210, 223-256, 263-287, 289-312, 327-337, 371-386, 389-394, 
406-419, 424-432, 438-450, 458-463, 475-502, 507-513, 519-526, 535-542, 550-567 and 361-376 of Seq ID No 
290; 10-39, 42-93, 100-144, 155-176, 178-224, 230-244, 246-255, 273-282, 292-301, 308-325, 332-351, 356-361, 
368-379, 386-393, 400-421 and 138-155 of Seq ID No 291; 5-11, 17-34, 40-45, 50-55, 72-80, 101-123, 145-151, 
164-172, 182-187, 189-195, 208-218, 220-241, 243-252, 255-270, 325-331, 365-371, 391-398, 402-418, 422-428, 
430-435, 443-452, 463-469, 476-484, 486-494, 503-509, 529-553, 560-565, 570-590, 608-614, 619-627, 654-661, 
744-750, 772-780, 784-790, 806-816, 836-853, 876-885, 912-918, 926-933, 961-975, 980-987, 996-1006, 1016- 
1028, 1043-1053, 1057-1062, 994-1003 and 1033-1056 of Seq ID No 292; 17-45, 64-71, 73-81, 99-109, 186-192, 
223-238, 262-275, 283-295, 336-346, 350-363, 375-385, 410-421, 425-431, 441-448, 454-463, 468-474, 476-512, 
523-537, 539-552, 568-583, 599-608, 612-620, 628-641, 644-654, 656-662, 670-681, 685-695, 702-708, 716-723, 
725-735, 757-764, 769-798, 800-806, 808-816, 826-840, 846-854, 856-862, 874-881, 885-902, 907-928, 274-350 
and 443-513 of Seq ID No 293; 4-22, 29-41, 45-51, 53-61, 70-76, 85-92, 99-104, 111-122, 134-140, 142-154, 
163-174, 224-232, 255-265, 273-279, 283-297, 330-335, 337-348, 356-367, 373-385, 391-396, 421-431, 442-455, 
475-485, 493-505, 526-538, 544-561, 587-599, 605-620, 622-651, 662-670, 675-681, 687-692, 697-712, 714-735 
and 252-262 of Seq ID No 294; 4-12, 15-35, 40-46, 50-59, 67-94, 110-128, 143-169, 182-188, 207-215, 218-228, 
238-250 and 74-90 of Seq ID No 295; 9-18, 42-58, 78-85, 88-95, 97-106, 115-122, 128-134, 140-145, 154-181, 
186-202, 204-223, 261-267, 269-278, 284-293, 300-336, 358-368 and 12-29 of Seq ID No 296; 7-34, 46-53, 62- 
72, 82-88, 100-105, 111-117, 132-137, 144-160, 166-180, 183-189, 209-221, 231-236, 246-253, 268-282, 286-293, 
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323-336 364-372, 378-392, 422-433 and 388-405 of Seq ID No 297; 21-27, 34-50, 72-77, 80-95, 164-177, 192- 
198 202-220 226-236, 239-247, 270-279, 285-292, 315-320, 327-334, 348-355, 364-371, 388-397, 453-476, 488- 
497 534-545 556-576, 582-588, 601-607, 609-616, 642-662, 674-681, 687-697, 709-715, 721-727, 741-755 and 
621-739 of Seq ID No 298; 4-14, 16-77, 79-109 and 25-99 of Seq ID No 299; 4-9, 17-23, 30-37, 44-55, 65-72, 
77-93, 102-121, 123-132, 146-153 and 17-29 of Seq ID No 300; 4-18, 25-41, 52-60, 83-92, 104-112, 117-123, 
149-155 159-167 170-192, 201-210, 220-227, 245-250 and 124-137 of Seq ID No 301; 8-25, 50-55, 89-95, 138- 
143, 148-153, 159-169, 173-179, 223-238, 262-268, 288-295, 297-308, 325-335, 403-409, 411-417, 432-446, 463- 
475 492-501 524-530, 542-548, 561-574, 576-593, 604-609, 612-622, 637-654, 665-672, 678-685, 720-725, 731- 
739 762-767' 777-783, 820-838, 851-865, 901-908, 913-920, 958-970, 1000-1006, 1009-1015, 1020-1026, 1043- 
1052 1055-1061 1-128, 252-341, 771-793 and 1043-1058 of Seq ID No 302; 16-26, 33-46 and 64-76 of Seq ID 
No 303; 4-27, 69-77, 79-101, 117-123, 126-142, 155-161, 171-186, 200-206, 213-231, 233-244, 267-273, 313-329, 
335-344 347-370 374-379, 399-408, 422-443, 445-453, 461-468, 476-482, 518-534, 544-553, 556-567, 578-595, 
601-620' 626-636, 646-658, 666-681, 715-721, 762-768, 778-785, 789-803, 809-819, 22-108, 153-318, 391-527 
and 638-757 of Seq ID No 304; 6-21, 32-43, 62-92, 104-123, 135-141, 145-152, 199-216, 218-226, 237-247, 260- 
269 274-283, 297-303, 1-72 and 127-211 of Seq ID No 305; 6-26, 50-56, 83-89, 108-114, 123-131, 172-181, 
194-200 221-238, 241-247, 251-259, 263-271, 284-292, 304-319, 321-335, 353-358, 384-391, 408-417, 424-430, 
442-448 459-466, 487-500, 514-528, 541-556, 572-578, 595-601, 605-613, 620-631, 635-648, 660-670, 673-679, 
686-693 702-708, 716-725, 730-735, 749-755, 770-777, 805-811, 831-837, 843-851, 854-860, 863-869, 895-901, 
904-914 922-929 933-938, 947-952, 956-963, 1000-1005, 1008-1014, 1021-1030, 1097-1103, 1120-1130, 1132- 
1140 1-213, 269-592 and 992-1120 of Seq ID No 306; 9-16, 33-39, 47-59, 65-79, 81-95, 103-108, 115-123, 138- 
148 163-171 176-185, 191-196, 205-211, 213-221, 224-256, 261-276, 294-302, 357-363, 384-390, 95-111 and 
161-189 of Seq ID No 307; 21-27, 35-45, 70-76, 92-105, 129-143, 145-155, 161-166, 170-191, 204-211, 214-231, 
234-246 249-255, 259-275 and 1-18 of Seq ID No 308; 21-35, 45-53, 56-64, 69-97 and 1-16 of Seq ID No 309; 
25-33, 41-47, 61-68, 86-101, 106-114, 116-129, 134-142, 144-156, 163-176, 181-190, 228-251, 255-261, 276-292, 
295-305 334-357, 368-380, 395-410, 424-429, 454-460, 469-482, 510-516, 518-527, 531-546, 558-570, 579-606, 
628-636^ 638-645, 651-656, 668-674, 691-698, 717-734, 742-754, 765-770, 792-797, 827-835, 847-859, 874-881, 
903-909 926-933, 942-961, 964-977, 989-1004, 1010-1028, 1031-1047, 1057-1075, 1081-1095, 1108-1117, 1138- 
1144 1182-1189,' 1193-1206, 1220-1229, 1239-1246, 1257-1267, 1271-1279, 1284-1301, 1312-1320, 1329-1335, 
1341-1347 1358-1371, 1399-1404, 1417-1426, 1458-1463, 1468-1476, 1478-1485, 1493-1506, 1535-1541, 1559- 
1574 1583-1590, 1595-1601, 1603-1611, 1622-1628, 1634-1644, 1671-1685, 1689-1696, 1715-1720, 1734-1746, 
1766-1775 1801-1806, 1838-1844, 1858-1871, 1910-1917, 1948-1955, 1960-1974, 2000-2015, 2019-2036, 2041- 
2063 748-847 and 1381-1391 of Seq ID No 310; 5-12, 18-24, 27-53, 56-63, 96-113, 119-124, 131-136, 157-163, 
203-209, 215-223, 233-246, 264-273, 311-316, 380-389, 393-399, 425-433, 445-450, 457-462, 464-470, 475-482, 
507-513, 527-535, 542-548, 550-565, 591-602, 607-613, 627-642, 644-664, 673-712, 714-732, 739-764, 769-782, 
812-818, 826-838, 848-854, 860-871, 892-906, 930-938, 940-954, 957-973, 990-998, 1002-1021, 1024-1033, 1037- 
1042 1050-1060, 1077-1083, 1085-1092, 1100-1129, 1144-1161, 1169-1175, 1178-1189, 1192-1198, 1201-1207, 
1211-1221, 1229-1239, 1250-1270, 1278-1292, 1294-1300, 1314-1335, 1344-1352, 1360-1374, 1394-1405, 1407- 
1414 1416-1424, 1432-1452, 1456-1462, 1474-1497, 1500-1510, 1516-1522, 1534-1542, 1550-1559, 1584-1603, 
1608-1627, 187-273 and 306-441 of Seq ID No 311; 70-80, 90-97, 118-125, 128-140, 142-148, 154-162, 189-202, 
214-222, 224-232, 254-260, 275-313, 317-332, 355-360, 392-398, 425-432, 448-456, 464-470, 476-482, 491-505, 
521-528, 533-546, 560-567, 592-597, 605-614, 618-626, 637-644, 646-653, 660-666, 677-691 and 207-227 of Seq 
ID No 312; 5-19, 26-34, 37-55, 57-66, 69-83, 86-102, 115-134, 138-143, 154-172, 178-195, 209-246, 251-257, 
290-302, 306-311 and 256-266 of Seq ID No 313; 10-20, 22-28, 35-57, 72-79, 87-103, 108-128, 130-144, 158- 
171 190-198, 225-242, 274-291, 301-315, 317-324, 374-385 and 353-365 of Seq ID No 314; 4-9, 17-30, 34-54, 
59-66 73-94,' 118-130, 135-150, 158-171, 189-198, 219-239, 269-275, 283-301, 89-106 and 176-193 of Seq ID 
No 315- 14-20, 22-74, 77-86, 89-99, 104-109, 126-135, 154-165, 181-195, 197-212, 216-224, 264-275 and 107- 
118 of Seq ID No 316; 4-18, 21-38, 63-72, 101-109, 156-162, 165-179, 183-192, 195-210, 212-218, 230-239, 241- 
256, 278-290, 299-311, 313-322, 332-341, 348-366, 386-401, 420-426, 435-450, 455-460, 468-479, 491-498, 510- 
518 532-538, 545-552, 557-563, 567-573, 586-595, 599-609, 620-626, 628-636, 652-657, 665-681 and 1-198 of 
Seq ID No 317; 4-10, 16-38, 51-68, 73-79, 94-115, 120-125, 132-178, 201-208, 216-223, 238-266, 269-295, 297- 
304 337-342, 347-356, 374-401, 403-422, 440-447, 478-504, 510-516, 519-530, 537-544 and 191-206 of Seq ID 
No318; 12-40, 42-48, 66-71, 77-86, 95-102, 113-120, 129-137, 141-148, 155-174, 208-214, 218-225, 234-240, 
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256-267 275-283 300-306, 313-321, 343-350, 359-367, 370-383, 398-405, 432-439, 443-461, 492-508, 516-526, 
528-535 and 370-478 of Seq ID No 319; 6-14, 20-37, 56-62, 90-95, 97-113, 118-125, 140-145, 161-170, 183-202, 
237-244 275-284, 286-305, 309-316, 333-359, 373-401, 405-412 and 176-187 of Seq ID No 320; 33-44, 50-55, 
59-80, 86-101, 129-139, 147-153, 157-163, 171-176, 189-201, 203-224, 239-245, 257-262, 281-287, 290-297, 304- 
320 322-331, 334-350, 372-390, 396-401, 71-88 and 353-372 of Seq ID No 321; 5-11, 15-24, 26-33, 40-47, 75- 
88 95-103, 105-112 and 17-30 of Seq ID No 322; 5-11, 16-39, 46-54, 62-82, 100-107, 111-124, 126-150, 154- 
165 167-183, 204-238, 245-295, 301-313, 316-335 and 8-16 of Seq ID No 323; 4-19, 34-48, 69-74, 79-107, 115- 
127 129-135, 143-153, 160-169, 171-182 and 142-153 of Seq ID No 324; 4-30, 65-74, 82-106, 110-120, 124-132, 
135-140 146-175, 179-184, 190-196, 217-223, 228-233, 250-267, 275-292, 303-315, 322-332 and 174-186 of Seq 
ID No 325; 9-16, 29-41, 47-57, 68-84, 87-109, 113-119, 162-180, 186-193, 195-201, 203-208, 218-230, 234-243, 
265-271 281-292, 305-312, 323-332, 341-347, 349-363, 368-374, 383-390, 396-410, 434-440, 446-452, 455-464, 
466-473, 515-522, 529-542, 565-570, 589-600, 602-613, 618-623, 637-644, 1019-1027, 1238-1244, 1258-1264, 
1268-1276 1281-1292, 1296-1302 and 883-936 of Seq ID No 326; 10-17, 23-32, 39-44, 54-72, 75-81, 88-111, 
138-154, 160-167, 178-185, 201-210, 236-252, 327-334, 336-342, 366-376, 388-400, 410-430, 472-482, 493-526, 
552-558' 586-592, 598-603, 612-621, 630-635, 641-660 and 384-393 of Seq ID No 327; 4-22, 24-39, 50-59, 73- 
84 100-105 111-117, 130-138, 155-161, 173-178, 182-189, 205-215, 266-284, 308-313, 321-328, 330-337, 346- 
363, 368-374, 388-395, 397-405, 426-434, 453-459, 482-492, 501-507, 509-515, 518-523, 527-544, 559-590, 598- 
612 614-629, 646-659, 663-684, 686-694, 698-721 and 445-461 of Seq ID No 328; 14-22, 27-33 and 3-17 of 
Seq ID No 329; 29-41, 66-73, 81-87, 90-108, 140-146, 150-159, 165-184, 186-196, 216-226, 230-238, 247-253, 
261-269 and 126-140 of Seq ID No 330; 5-12, 16-25, 27-33, 36-45, 60-68, 83-88, 103-126 and 86-101 of Seq ID 
No 331- 14-23, 36-47, 56-66, 84-89, 94-105, 111-127, 140-153, 160-174, 176-183, 189-203, 219-225, 231-237, 
250-257 and 194-227 of Seq ID No 332; 4-25, 54-60, 64-71, 73-82, 89-106, 117-124, 157-169, 183-188, 199-210, 
221-232, 236-244, 255-264 and 58-98 of Seq ID No 333; 13-19, 26-36, 41-53, 55-71, 77-84, 86-108, 114-135, 
157-172, 177-183, 187-194, 208-213, 218-226, 110-125 and 156-170 of Seq ID No 334; 5-24, 63-69, 77-85, 94- 
112, 120-137, 140-146, 152-159, 166-172, 179-187, 193-199, 206-212, 222-228, 234-240, 244-252, 257-264, 270- 
289, 298-309, 316-328, 337-348, 363-375, 1-56 and 340-352 of Seq ID No 335; 18-39, 42-71, 78-120, 124-144, 
152-173 179-189, 199-209, 213-222, 228-258, 269-304, 329-361, 364-372, 374-389, 396-441 and 313-327 of Seq 
ID No 336; 19-25, 91-98, 108-120, 156-162, 168-174, 191-204, 211-216, 232-266, 272-278, 286-308, 316-321, 
327-333, 344-355, 358-364, 384-391, 395-428, 464-476, 487-495, 497-511, 544-561, 563-573, 575-582, 588-594, 
10-25 and 322-338 of Seq ID No 337; 14-26, 32-49, 51-57, 59-72, 80-91, 102-112, 119-125, 147-161, 164-173, 
175-183 188-213, 217-222, 246-254, 260-276, 282-303, 308-318, 321-328, 333-350, 352-359, 371-378, 392-401, 
407-414^ 416-443, 448-463, 471-484, 490-497, 501-514, 519-527, 539-551, 557-570, 578-590, 592-598, 600-610, 
618-629, 633-647, 654-667, 676-689, 702-709, 718-726, 728-737, 741-760, 764-780, 786-795, 808-826, 836-842, 
845-852, 865-874, 881-887, 931-945, 949-957, 968-974, 979-986, 1003-1009, 1023-1029 and 90-103 of Seq ID 
No 338; 11-16, 37-56, 60-66, 69-77, 80-88, 93-106, 117-139, 166-171 and 72-90 of Seq ID No 339; 59-84, 123- 
133, 145-150, 161-167, 178-189 and 115-128 of Seq ID No 340; 15-33, 39-46, 52-64, 74-87, 108-124, 127-144, 
150-156, 173-179, 184-194, 201-208, 219-236, 243-269, 272-295, 302-309, 343-349, 356-361, 370-379, 405-411, 
414-423, 430-451, 457-464, 466-475, 477-483, 496-502, 507-522, 541-548, 557-563, 571-577, 579-585, 590-605, 
626-642, 650-662, 671-691, 704-710, 751-769, 775-781, 786-791, 794-829, 851-858, 868-878, 884-904, 913-919, 
931-939 and 132-142 of Seq ID No 341; 33-58, 64-71, 74-80, 83-88, 96-120, 122-139, 146-157, 167-177, 207- 
213, 220-225, 236-242, 264-279, 300-305, 326-336, 340-347, 350-360, 97-115 and 199-211 of Seq ID No 342; 4- 
26, 43-57, 70-99, 102-117, 121-133, 142-148, 151-168, 170-183, 192-220, 235-249, 258-279 and 30-41 of Seq ID 
No 343; 34-42, 48-58, 70-94, 110-130, 154-160, 164-172, 178-183, 195-203, 211-222, 229-250, 256-261, 274-284, 
286-292, 312-323 and 222-233 of Seq ID No 344; 4-9, 15-36, 38-45, 49-74, 78-88, 100-112, 136-191, 211-220, 
226-233, 239-246, 254-274, 287-307, 316-322, 342-353, 356-366, 373-378, 384-393, 405-431, 449-457, 459-468, 
487-511, 515-524, 529-541, 544-552, 562-568, 571-576 and 208-280 of Seq ID No 345; 10-27, 31-37, 39-54, 71- 
108 124-143 and 2-107 of Seq ID No 346; 16-27, 38-57, 64-70, 90-102, 104-113, 116-137, 160-166 and 1-80 of 
Seq ID No 347; 13-21, 31-36, 56-67, 127-136, 153-171, 173-180, 184-200, 214-222, 225-231, 239-263, 267-273 
and 135-159 of Seq ID No 348; 12-27, 31-51, 68-74, 77-87, 94-101, 108-114, 117-123, 127-134, 138-168, 173- 
196, 201-207, 212-217, 227-237, 247-257, 264-280 and 205-223 of Seq ID No 349; 17-22, 25-54, 70-76, 92-100 
and 98-110 of Seq ID No 350; 7-29, 40-50, 60-67, 87-96, 105-111, 119-164, 172-199, 206-212, 220-227, 237- 
259, 272-279, 282-293, 295-309, 313-319, 321-328, 345-363, 376-386 and 159-176 of Seq ID No 351; 4-19, 24- 
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30 36-43 50-68, 71-89, 93-106, 141-152, 154-172, 179-197, 199-215, 229-239, 246-252, 255-263, 281-298, 319- 
325 329-356 358-368, 374-390, 397-409, 420-429, 432-444, 450-456, 459-475, 483-494, 496-502, 520-528, 532- 
556 and 362-377 of Seq ID No 352; 18-25, 40-62, 77-85, 91-97, 105-116, 123-133, 139-184, 189-197 and 122- 
140 of Seq ID No 353; 4-49, 52-58, 62-70, 79-105, 109-133, 142-150, 163-168, 206-214, 220-228, 233-240, 243- 
254 274-281 303-311, 327-338, 357-373, 378-396, 403-413, 420-436, 441-453, 461-467, 475-481, 484-498, 506- 
512 514-521 523-529, 562-579, 589-595, 598-603, 615-648, 714-722, 728-742, 749-758, 777-792, 795-807 and 
643-658 of Seq ID No 354; 8-27, 37-48, 51-56, 72-79, 87-106, 120-138, 140-147, 167-176, 187-197, 205-216, 
222-229 234-239, 243-249, 277-288, 292-315, 334-343, 347-353, 363-391, 398-404, 430-447, 461-467, 478-492, 
498-507 and 456-470 of Seq ID No 355; 5-12, 18-24, 59-69, 80-93, 95-109, 119-125, 130-137, 139-147, 158-163, 
168-176 182-202, 206-215, 222-239, 241-249, 267-277, 291-298, 311-318, 321-327, 338-344, 348-355, 373-386, 
393-406 411-417, 434-443, 446-465, 473-484, 514-521, 532-553, 584-594 and 221-237 of Seq ID No 356; 4-14, 
27-34 50-58 63-72, 79-106, 109-114, 121-142, 146-154, 161-167, 169-175, 178-201, 223-238, 249-254, 259-264, 
278-292 294-312 319-330 and 167-191 of Seq ID No 357; 7-28, 36-42, 50-61, 63-80, 122-152, 161-174, 176- 
191 and 140-190 of Seq ID No 358; 20-57, 59-65, 70-78, 86-102, 119-133, 142-161, 163-173, 177-188, 192-202, 
204-220 222-236, 240-253, 279-319, 326-331, 337-383, 390-399, 406-412, 420-427, 431-438 and 381-395 of Seq 
ID No 359- 13-18, 28-34, 37-43, 50-59, 75-81, 83-97, 105-121, 139-147, 200-206, 209-227, 231-247, 260-271, 
318-327 366-381, 388-394, 399-406 and 182-201 of Seq ID No 360; 6-29, 37-43, 51-56, 70-77, 82-102, 110-119, 
127-143 178-190 201-209, 216-243, 261-269, 281-292, 305-313, 327-339, 341-354, 356-373, 391-397, 423-429, 
438-445 450-478 and 21-314 of Seq ID No 361; 4-12, 15-21, 32-41, 59-76, 80-89, 96-104 and 90-103 of Seq ID 
No 362; 9-28, 30-41, 44-54, 69-74, 77-82, 90-97, 104-123, 125-135, 149-155, 164-173, 177-184, 217-226, 230-235, 
238-244 258-272, 282-297, 300-305, 309-315, 317-322, 327-336, 348-362, 368-374, 380-387, 400-411, 414-424, 
451-458 460-466, 483-494, 497-503, 506-511, 521-528, 540-553, 569-587, 598-606, 628-642, 661-681, 688-700, 
718-733 740-749, 752-764, 769-783, 823-834, 848-854, 862-872, 878-884, 886-898, 915-920, 938-951, 954-961, 
963-972 982-989 996-1003, 1010-1016, 1021-1032, 1038-1044, 1047-1057, 1060-1070, 1079-1088, 1094-1102, 
1117-1127, 1129-1135, 1142-1153, 1158-1204, 1212-1229, 1234-1263, 1269-1277, 1308-1313, 1327-1338, 1344- 
1376 1400-1415 1436-1443, 1448-1458, 1497-1504, 1511-1522, 1544-1566, 3-82 and 509-576 of Seq ID No 
363-8-36 40-64, 71-79, 88-94, 102-109, 118-127, 138-148, 151-159, 163-174, 192-198, 200-206, 220-233, 268- 
273 290-301, 304-309, 316-323, 331-349, 378-391, 414-420, 427-437, 455-475, 494-510, 541-547, 549-555, 616- 
640' 1-60 55-139, 212-308, 386-458 and 458-624 of Seq ID No 364; 16-31, 35-42, 70-77, 91-101, 120-130, 132- 
140' 143-153, 185-190, 195-202, 215-222, 228-238, 241-251, 257-264, 268-277, 288-302, 312-324, 326-333, 341- 
348 364-382, 415-429, 438-454, 458-466, 491-499, 501-521 and 273-281 of Seq ID No 365; 8-14, 32-57, 74-149, 
155-177 179-212, 221-266, 271-296, 304-324, 329-346, 349-359, 368-401, 413-419, 426-454, 465-478, 493-510 
and 466-490 of Seq ID No 366; 22-28, 33-51, 64-89, 96-119, 126-132, 138-146, 152-159, 161-169, 172-179, 193- 
198 205-211 221-231, 235-254, 273-280, 297-303, 312-320, 328-346, 351-373, 378-384, 391-398, 448-454, 460- 
468 470-481, 516-558, 574-593, 597-602, 613-623, 626-646, 649-656, 668-673, 675-683, 696-708, 715-722, 724- 
739 745-751, 759-777, 780-804, 816-822 and 102-113 of Seq ID No 367; 12-28, 41-91, 98-107, 112-120, 125- 
131 151-193 215-221, 240-250, 263-280 and 128-138 of Seq ID No 368; 16-24, 32-38, 46-62, 68-81, 90-105, 
127-133, 144-150, 160-166, 178-184, 186-202, 210-219, 232-240, 252-258, 264-273, 293-324, 337-344, 349-357, 
360-369 385-398, 410-416, 419-427, 441-449, 458-476, 508-515, 523-539, 544-549, 562-569, 571-579, 96-109 
and 127-139 of Seq ID No 369; 19-25, 28-34, 56-61, 85-97, 110-116 and 39-53 of Seq ID No 370; 4-37, 41-50, 
62-72 91-97 99-109, 114-125, 136-141, 149-158, 160-166, 201-215 and 27-225 of Seq ID No 371; 15-31, 44-51, 
96-105, 122-130, 149-157, 162-168, 178-183, 185-192, 198-204, 206-213, 221-234, 239-245, 248-255, 257-266, 
289-335 349-357 415-422, 425-441, 448-454, 462-468 and 463-481 of Seq ID No 372; 5-31, 39-55, 63-72, 76- 
99 106-155 160-177, 179-199, 207-217, 223-240, 245-255, 261-267, 294-316, 321-343, 354-378, 382-452, 477- 
488, 529-536, 555-569, 584-591, 593-612, 620-627, 632-640, 647-654, 671-680, 698-704, 723-730, 732-750, 769- 
775, 781-788, 822-852 and 505-525 of Seq ID No 373; 3-18 of Seq ID No 374; 4-14 and 12-24 of Seq ID No 
375- 4-11, 22-30 and 12-25 of Seq ID No 376; 5-12 and 4-18 of Seq ID No 377; 4-28 and 7-14 of Seq ID No 
378; 6-16 and 8-16 of Seq ID No 379; 4-15, 18-33 and 24-36 of Seq ID No 380; 4-10, 16-21 and 20-31 of Seq 
ID No 381; 6-19 of Seq ID No 382; 11-18 and 3-10 of Seq ID No 383; 13-24 and 3-15 of Seq ID No 384; 15- 
27 and 7-16 of Seq ID No 385; 11-16 and 1-15 of Seq ID No 386; 4-16 and 9-21 of Seq ID No 387; 4-24, 40- 
48, 54-67 and 22-39 of Seq ID No 388; 6-30, 34-55, 62-68, 78-106 and 68-74 of Seq ID No 389; 3-14 of Seq 
ID No 390; 9-19 and 6-21 of Seq ID No 391; 4-17 and 1-9 of Seq ID No 392; 5-30 and 1-8 of Seq ID No 
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393; 4-16, 23-46, 51-56 and 45-55 of Seq ID No 394; 7-16 of Seq ID No 395; 2-14 of Seq ID No 396; 4-36, 43- 
65 and 50-62 of Seq ID No 397; 10-30 and 14-21 of Seq ID No 398; 9-17 and 1-10 of Seq ID No 399; 4-12 
and 3-16 of Seq ID No 400; 4-15 and 5-23 of Seq ID No 401; 10-21 of Seq ID No 402; 6-16 of Seq ID No 
403; 4-29, 31-38 and 2-14 of Seq ID No 404; 4-35 and 33-42 of Seq ID No 405; 2-17 of Seq ID No 406; 9-18, 
30-35 and 15-33 of Seq ID No 407; 4-9 and 6-12 of Seq ID No 408; 3-17 of Seq ID No 409; 12-21, 37-44, 52- 
61, 72-80 and 38-48 of Seq ID No 410; 4-10, 29-44, 54-61, 69-78 and 13-27 of Seq ID No 411; 13-23, 36-53 
and 2-15 of Seq ID No 412; 4-25, 28-46, 56-72, 81-99, 120-132, 134-142, 154-160 and 129-141 of Seq ID No 
413; 4-15, 24-33, 35-41, 64-86 and 21-33 of Seq ID No 414; 9-15 and 4-13 of Seq ID No 415; 4-11, 13-19, 34- 
48 and 15-32 of Seq ID No 416; 4-21 and 11-31 of Seq ID No 417; 23-57 and 38-50 of Seq ID No 418; 4-32 
and 3-13 of Seq ID No 419; 4-10, 13-25, 32-42, 56-68, 72-84 and 26-38 of Seq ID No 420; 4-20, 31-48, 52-58, 
65-71, 80-93, 99-108, 114-123 and 37-49 of Seq ID No 421; 6-12, 14-20 and 3-25 of Seq ID No 422; 14-25, 27- 
38 and 5-14 of Seq ID No 423; 4-41, 57-105, 109-118, 123-136, 144-152 and 86-99 of Seq ID No 424; 6-19 of 
Seq ID No 425; 2-19 of Seq ID No 426; 14-47 and 1-14 of Seq ID No 427; 4-21, 29-44 and 2-18 of Seq ID 
No 428; 23-29 and 10-28 of Seq ID No 429; 6-16, 22-36 and 11-22 of Seq ID No 430; 4-19, 30-44 and 18-27 
of Seq ID No 431; 5-15, 37-45, 58-65 and 38-47 of Seq ID No 432; 4-15, 23-34 and 4-15 of Seq ID No 433; 
30-36, 44-54, 79-85, 101-114, 138-152, 154-164, 170-175, 179-200, 213-220, 223-240, 243-255, 258-264, 268-284 
and 10-28 of Seq ID No 434; the peptides comprising amino acid sequences of column "Identical 
region" of the Table IB, especially peptides comprising amino acid 210-226 and 738-753 of Seq ID No 
449; 326-344, 326-348, 338-354, 371-392, 801-809 and 877-901 of Seq ID No 450; 893-906 of Seq ID No 451; 
51-69 of Seq ID No 452; 110-125 of Seq ID No 453; 291-305 of Seq ID No 454; 210-226 and 738-753 of Seq 
ID No 455; 326-344, 326-348, 338-354, 371-392, 801-809 and 877-901 of Seq ID No 456; 893-906 of Seq ID 
No 457; 51-69 of Seq ID No 458; 110-125 of Seq ID No 459; 291-305 of Seq ID No 460; 32-44 of Seq ID No 
461; 399-410 of Seq ID No 462; the serum reactive epitopes as specified in the column of "aa from" to 
"aa'to" of Table 2, especially peptides comprising amino acid 120-143, 138-161 and 156-179 of Seq ID 
No 218; 110-129 and 168-184 of Seq ID No 219; 74-90 of Seq ID No 222; 759-773 of Seq ID No 223; 237- 
260 of Seq ID No 224; 265-284 of Seq ID No 225; 65-74 of Seq ID No 226; 41-50 of Seq ID No 227; 163-174 
of Seq ID No 229; 26-37 of Seq ID No 230; 174-189 of Seq ID No 232; 240-256 of Seq ID No 234; 285-297 
of Seq ID No 236; 238-247 of Seq ID No 238; 491-519 of Seq ID No 239; 114-140 of Seq ID No 243; 267- 
284 of Seq ID No 250; 439-453 of Seq ID No 252; 162-178 of Seq ID No 253; 347-364 of Seq ID No 254; 
699-715 of Seq ID No 255; 60-71 of Seq ID No 256; 244-257 of Seq ID No 257; 44-63 and 57-76 of Seq ID 
No 258; 185-196 of Seq ID No 260; 119-129 of Seq ID No 263; 182-195 of Seq ID No 266; 32-44 and 424- 
442 of Seq ID No 267; 247-256 of Seq ID No 268; 678-694, 785-805, 55-77 and 72-94 of Seq ID No 269; 210- 
226 of Seq ID No 281; 37-59 of Seq ID No 289; 13-29 of Seq ID No 296; 136-159 of Seq ID No 348; 205-222 
of Seq ID No 349; 99-110 of Seq ID No 350; 160-176 of Seq ID No 351; 457-470 of Seq ID No 355; 221-237 
of Seq ID No 356; 167-190 of Seq ID No 357; 96-120 of Seq ID No 361; 399-417, 503-519 and 544-563 of 
Seq ID No 364; 46-68, 159-183 and 184-198 of Seq ID No 371; 463-481 of Seq ID No 372; the 
immunogenic epitopes as specified in the column of "aa from" to "aa to" of Table 4; especially 
peptides comprising amino acid 110-129 and 168-184 of Seq ID No 219; 877-901, 333-354, 326-344 and 
801-809 of Seq ID No 277; 1-54 of Seq ID No 347; 544-563, 31-51, 107-119, 399-417 and 503-519 of Seq ID 
No 364; 120-198 of Seq ID No 218; 20-35 of Seq ID No 219; 118-201 of Seq ID No 221; 48-132 of Seq ID 
No 242; 118-136 of Seq ID No 249; 162-178 of Seq ID No 253; 347-364 of Seq ID No 254; 699-715 of Seq 
ID No 255; 50-76 of Seq ID No 258; 785-819 and 44-128 of Seq ID No 269; 90-128 of Seq ID No 274; 314- 
384 of Seq ID No 289; 327-349 of Seq ID No 293; 242-314, 405-478 and 23-100 of Seq ID No 304; 129-210 
of Seq ID No 305; 162-188 of Seq ID No 307; 750-772 of Seq ID No 310; 1-56 of Seq ID No 335; 322-337 of 
Seq ID No 337; 72-90 of Seq ID No 339; 374-395 of Seq ID No 345; 136-159 of Seq ID No 348; 141-164 of 
Seq ID No 358; 96-157 of Seq ID No 361; 1-82 of Seq ID No 363; 489-556 of Seq ID No 364; 159-183 and 
49-133 of Seq ID No 371; The peptides comprising amino acid sequences of column "predicted 
immunogenic aa" and "location of identified immunogenic region (aa)" of Table 5, especially peptides 
comprising amino acid 4-26, 35-41, 53-61, 73-84, 103-108, 114-120, 140-146, 156-162, 192-208, 214-219, 227- 
233, 239-252, 260-268, 284-297, 1-48 and 113-133 of Seq ID No 475; 4-27, 38-44, 50-56, 59-64, 72-79, 83-89, 
92-97, 108-116, 123-148, 152-167, 183-196, 200-220, 232-244, 255-261, 265-274, 282-302, 309-317, 1-79 and 
231-302 of Seq ID No 476; 6-28, 66-72, 85-105, 115-121, 144-151, 160-170, 176-185, 223-230, 252-288, 296- 
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310, 319-333, 367-374, 458-464, 471-480, 483-488, 520-528, 530-549, 559-564, 593-601, 606-616, 636-643, 655- 
662, 676-682, 684-699, 719-726, 735-750, 757-764, 777-785, 799-810, 812-843, 846-853, 868-873, 880-889, 891- 
399' 909-929, 934-940, 963-969, 998-1004, 1007-1014, 1016-1022, 1030-1046, 1-80 and 808-821 of Seq ID No 
477; 7-24, 35-41, 75-81, 91-114, 122-132, 137-144, 148-156, 183-192, 194-200, 212-228, 233-238, 251-258, 275- 
295, 326-332, 337-346, 1-79 and 305-321 of Seq ID No 478; 31-38, 42-52, 66-72, 86-92, 98-104, 115-122, 127- 
146, 154-164, 169-187, 198-212, 225-237, 255-269, 13-92 and 135-142 of Seq ID No 479; 4-36, 39-49, 63-69, 
71-77, 81-88, 123-131, 133-139, 160-169, 174-180, 188-194, 210-217, 273-278, 289-300, 317-334, 336-341, 383- 
401, 425-438, 1-68, 212-270 and 402-446 of Seq ID No 480; 21-29, 31-42, 49-63, 72-79, 81-93, 112-132, 159- 
165, 188-195, 197-232, 262-267, 279-286, 294-301, 318-326, 348-366, 381-405, 409-426, 436-465, 471-480, 484- 
492, 497-505, 521-544, 554-561, 567-577, 581-589, 601-609, 611-622, 636-651, 653-667, 669-685, 700-708, 716- 
722', 729-744, 749-766, 780-786, 789-811, 814-864, 1-57 and 84-106 of Seq ID No 481; 6-24, 35-48, 57-63, 72- 
78, 87-92, 113-119, 123-137, 147-153, 173-181, 212-233 and 1-124 of Seq ID No 482; 13-34, 62-69, 78-83, 86- 
9l', 98-104, 107-115, 146-159, 179-188, 195-205, 209-221, 226-233, 239-253, 276-282, 284-294, 297-308, 331-354, 
375-382, 388-399, 421-433, 449-458, 464-469, 472-491, 508-513, 525-531, 534-550, 575-593, 601-618, 629-635, 
654-661, 666-680, 706-721, 723-740, 771-805, 810-830, 845-851 and 1-84 of Seq ID No 483; 4-32, 45-64, 73-83, 
86-92, 100-111, 125-147, 157-163, 170-175, 177-188, 226-232, 245-252, 258-274, 320-335, 348-359 and 1-71 of 
Seq ID No 484; 13-40, 43-71, 76-83, 87-101, 109-119, 125-156, 162-175, 182-219, 226-232, 240-262, 270-287, 
306-318, 326-342, 344-408, 414-444, 449-456 and 1-51 of Seq ID No 485; 4-16, 18-34, 45-54, 99-108, 134-140, 
203-212, 241-257, 266-274, 279-291, 308-315, 330-336, 355-370, 374-382, 402-410, 428-455, 466-472, 474-480, 
531-554, 560-566, 572-580, 597-618, 632-660, 664-674, 676-685, 691-705, 708-735, 750-768, 1-87 and 342-480 
of Seq ID No 486; The serum reactive epitopes as specified in the column of "aa from" to "aa to" of 
Table 6, especially peptides comprising amino acid 115-132 and 1-26 of Seq ID No 475; 33-55 of Seq ID 
No 476; 1-25 of Seq ID No 478; 37-61 of Seq ID No 479; 1-24 of Seq ID No 480; 1-23 of Seq ID No 481; 46- 
60 of Seq ID No 482; 1-28, 23-50 and 45-71 of Seq ID No 483; 1-22 and 17-38 of Seq ID No 484; 1-22 and 
17-38 of Seq ID No 485; 1-27, 22-47 and 422-447 of Seq ID No 486; The immunogenic epitopes as 
specified in the column of "aa from" to "aa to" of Table 7, especially peptides comprising amino acid 
115-132 and 1-47 of Seq ID No 475; 1-55 of Seq ID No 476; 22-85 of Seq ID No 477; 307-320 and 1-44 of 
Seq ID No 478; 15-76 and 40-92 of Seq ID No 479; 1-59, 213-269 and 403-445 of Seq ID No 480; 1-56 and 
85-105 of Seq ID No 481; 37-121 of Seq ID No 482; 1-71 of Seq ID No 483; 1-38 of Seq ID No 484; 1-38 of 
Seq ID No 485; 1-47 of Seq ID No 486. 

The present invention also provides a process for producing a S. agalactiae hyperimmune serum reactive 
antigen or a fragment thereof according to the present invention comprising expressing one or more of 
the nucleic acid molecules according to the present invention in a suitable expression system. 

Moreover, the present invention provides a process for producing a cell, which expresses a S agalactiae 
hyperimmune serum reactive antigen or a fragment thereof according to the present invention 
comprising transforming or transfecting a suitable host cell with the vector according to the present 
invention. 

According to the present invention a pharmaceutical composition, especially a vaccine, comprising a 
hyperimmune serum-reactive antigen or a fragment thereof as defined in the present invention or a 
nucleic acid molecule as defined in the present invention is provided. 

In a preferred embodiment the pharmaceutical composition further comprises an immunostimulatory 
substance, preferably selected from the group comprising polycationic polymers, especially polycationic 
peptides, immunostimulatory deoxynucleotides (ODNs), peptides containing at least two LysLeuLys 
motifs, especially klklsklk, neuroactive compounds, especially human growth hormone, alumn, Freund's 
complete or incomplete adjuvants or combinations thereof. 

In a more preferred embodiment the immunostimulatory substance is a combination of either a 
polycationic polymer and immunostimulatory deoxynucleotides or of a peptide containing at least two 
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LysLeuLys motifs and immunostimulatory deoxy nucleotides. 

In a still more preferred embodiment the polycationic polymer is a polycationic peptide, especially 
polyarginine. 

According to the present invention the use of a nucleic acid molecule according to the present invention 
or a hyperimmune serum-reactive antigen or fragment thereof according to the present invention for the 
manufacture of a pharmaceutical preparation, especially for the manufacture of a vaccine against S. 
agalactiae infection, is provided. 

Also an antibody, or at least an effective part thereof, which binds at least to a selective part of the 
hyperimmune serum-reactive antigen or a fragment thereof according to the present invention, is 
provided herewith. 

In a preferred embodiment the antibody is a monoclonal antibody. 

In another preferred embodiment the effective part of the antibody comprises Fab fragments. 
In a further preferred embodiment the antibody is a chimeric antibody. 
In a still preferred embodiment the antibody is a humanized antibody. 

The present invention also provides a hybridoma cell line, which produces an antibody according to the 
present invention. 

Moreover, the present invention provids a method for producing an antibody according to the present 
invention, characterized by the following steps: 

• initiating an immune response in a non-human animal by administrating an hyperimmune 
serum-reactive antigen or a fragment thereof, as defined in the invention, to said animal, 

• removing an antibody containing body fluid from said animal, and 

• producing the antibody by subjecting said antibody containing body fluid to further 
purification steps. 

Accordingly, the present invention also provides a method for producing an antibody according to the 
present invention, characterized by the following steps: 

• initiating an immune response in a non-human animal by administrating an hyperimmune 
serum-reactive antigen or a fragment thereof, as defined in the present invention, to said animal, 

• removing the spleen or spleen cells from said animal, 

• producing hybridoma cells of said spleen or spleen cells, 

• selecting and cloning hybridoma cells specific for said hyperimmune serum-reactive antigens or a 
fragment thereof, 

• producing the antibody by cultivation of said cloned hybridoma cells and optionally further 
purification steps. 

The antibodies provided or produced according to the above methods may be used for the preparation of 
a medicament for treating or preventing S. agalactiae infections. 

According to another aspect the present invention provides an antagonist, which binds to a 
hyperimmune serum-reactive antigen or a fragment thereof according to the present invention. 

Such an antagonist capable of binding to a hyperimmune serum-reactive antigen or fragment thereof 
according to the present invention may be identified by a method comprising the following steps: 
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a) contacting an isolated or immobilized hyperimmune serum-reactive antigen or a fragment 
thereof according to the present invention with a candidate antagonist under conditions to 
permit binding of said candidate antagonist to said hyperimmune serum-reactive antigen or 
fragment, in the presence of a component capable of providing a detectable signal in response to 
the binding of the candidate antagonist to said hyperimmune serum reactive antigen or fragment 
thereof; and 

b) detecting the presence or absence of a signal generated in response to the binding of the 
antagonist to the hyperimmune serum reactive antigen or the fragment thereof. 

An antagonist capable of reducing or inhibiting the interaction activity of a hyperimmune serum-reactive 
antigen or a fragment thereof according to the present invention to its interaction partner may be 
identified by a method comprising the following steps: 

a) providing a hyperimmune serum reactive antigen or a hyperimmune fragment thereof according 
to the present invention, 

b) providing an interaction partner to said hyperimmune serum reactive antigen or a fragment 
thereof, especially an antibody according to the present invention, 

c) allowing interaction of said hyperimmune serum reactive antigen or fragment thereof to said 
interaction partner to form an interaction complex, 

d) providing a candidate antagonist, 

e) allowing a competition reaction to occur between the candidate antagonist and the interaction 
complex , 

f) determining whether the candidate antagonist inhibits or reduces the interaction activities of the 
hyperimmune serum reactive antigen or the fragment thereof with the interaction partner. 

The hyperimmune serum reactive antigens or fragments thereof according to the present invention may 
be used for the isolation and/or purification and/or identification of an interaction partner of said 
hyperimmune serum reactive antigen or fragment thereof. 

The present invention also provides a process for in vitro diagnosing a disease related to expression of a 
hyperimmune serum-reactive antigen or a fragment thereof according to the present invention 
comprising determining the presence of a nucleic acid sequence encoding said hyperimmune serum 
reactive antigen or fragment thereof according to the present invention or the presence of the 
hyperimmune serum reactive antigen or fragment thereof according to the present invention. 

The present invention also provides a process for in vitro diagnosis of a bacterial infection, especially a S. 
agalactiae infection, comprising analyzing for the presence of a nucleic acid sequence encoding said 
hyperimmune serum reactive antigen or fragment thereof according to the present invention or the 
presence of the hyperimmune serum reactive antigen or fragment thereof according to the present 
invention. 

Moreover, the present invention provides the use of a hyperimmune serum reactive antigen or fragment 
thereof according to the present invention for the generation of a peptide binding to said hyperimmune 
serum reactive antigen or fragment thereof, wherein the peptide is an anticaline. 

The present invention also provides the use of a hyperimmune serum-reactive antigen or fragment 
thereof according to the present invention for the manufacture of a functional nucleic acid, wherein the 
functional nucleic acid is selected from the group comprising aptamers and spiegelmers. 

The nucleic acid molecule according to the present invention may also be used for the manufacture of a 
functional ribonucleic acid, wherein the functional ribonucleic acid is selected from the group comprising 
ribozymes, antisense nucleic acids and siRNA. 
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The present invention advantageously provides an efficient, relevant and comprehensive set of isolated 
nucleic acid molecules and their encoded hyperimmune serum reactive antigens or fragments thereof 
identified from S. agalactiae using an antibody preparation from multiple human plasma pools and 
surface expression libraries derived from the genome of S. agalactiae. Thus, the present invention fulfils a 
widely felt demand for S. agalactiae antigens, vaccines, diagnostics and products useful in procedures for 
preparing antibodies and for identifying compounds effective against S. agalactiae infection. 

An effective vaccine should be composed of proteins or polypeptides, which are expressed by all strains 
and are able to induce high affinity, abundant antibodies against cell surface components of S. agalactiae. 
The antibodies should be IgGl and/or IgG3 for opsonization, and any IgG subtype and IgA for 
neutralisation of adherence and toxin action. A chemically defined vaccine must be definitely superior 
compared to a whole cell vaccine (attenuated or killed), since components of S. agalactiae, which cross- 
react with human tissues or inhibit opsonization can be eliminated, and the individual proteins inducing 
protective antibodies and/or a protective immune response can be selected. 

The approach, which has been employed for the present invention, is based on the interaction of GBS 
proteins or peptides with the antibodies present in human sera. The antibodies produced against S. 
agalactiae by the human immune system and present in human sera are indicative of the in vivo 
expression of the antigenic proteins and their immunogenicity. In addition, the antigenic proteins as 
identified by the bacterial surface display expression libraries using pools of pre-selected sera, are 
processed in a second and third round of screening by individual selected or generated sera. Thus the 
present invention supplies an efficient, relevant, comprehensive set of GBS antigens as a pharmaceutical 
composition, especially a vaccine preventing infection by S. agalactiae. 

In the antigen identification program for identifying a comprehensive set of antigens according to the 
present invention, at least two different bacterial surface expression libraries are screened with several 
serum pools or plasma fractions or other pooled antibody containing body fluids (antibody pools). The 
antibody pools are derived from a serum collection, which has been tested against antigenic compounds 
of S. agalactiae, such as whole cell extracts and culture supernatant proteins. Preferably, three distinct 
serum collections are used, obtained ad 1. from healthy pregnant women tested negative for cervical and 
anorectal carriage of GBS, ad 2. healthy pregnant women tested positive for cervical and/or anorectal 
carriage of GBS who's newborn remained GBS-free (although with antibiotic prevention), ad 3. from 
adults below <45 years of age without clinical disease. Sera have to react with multiple GBS-specific 
antigens in order to be considered hyperimmune and therefore relevant in the screening method applied 
for the present invention. 

The expression libraries as used in the present invention should allow expression of all potential antigens, 
e.g. derived from all secreted and surface proteins of S. agalactiae. Bacterial surface display libraries will 
be represented by a recombinant library of a bacterial host displaying a (total) set of expressed peptide 
sequences of S. agalactiae on two selected outer membrane proteins (LamB and FhuA) at the bacterial host 
membrane {Georgiou, G., 1997}; {Etz, H. et al., 2001}. One of the advantages of using recombinant 
expression libraries is that the identified hyperimmune serum-reactive antigens may be instantly 
produced by expression of the coding sequences of the screened and selected clones expressing the 
hyperimmune serum-reactive antigens without further recombinant DNA technology or cloning steps 
necessary. 

The comprehensive set of antigens identified by the described program according to the present 
invention is analysed further by one or more additional rounds of screening. Therefore individual 
antibody preparations or antibodies generated against selected peptides, which were identified as 
immunogenic are used. According to a preferred embodiment the individual antibody preparations for 
the second round of screening are derived from pregnant women and non-pregant adults who show an 
antibody titer above a certain minimum level, for example an antibody titer being higher than 80 
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percentile, preferably higher than 90 percentile, especially higher than 95 percentile of the human (patient 
or healthy individual) sera tested. Using such high titer individual antibody preparations in the second 
screening round allows a very selective identification of the hyperimmune serum-reactive antigens and 
fragments thereof from S. agalactiae. 

Following the comprehensive screening procedure, the selected antigenic proteins, expressed as 
recombinant proteins or in vitro translated products, in case it can not be expressed in prokaryotic 
expression systems, or the identified antigenic peptides (produced synthetically) are tested in a second 
screening by a series of ELISA and Western blotting assays for the assessment of their immunogenicity 
with a large human serum collection (minimum -150 healthy and patients sera). 

It is important that the individual antibody preparations (which may also be the selected serum) allow a 
selective identification of the most promising candidates of all the hyperimmune serum-reactive antigens 
from all the promising candidates from the first round. Therefore, preferably at least 10 individual 
antibody preparations (i.e. antibody preparations (e.g. sera) from at least 10 different individuals having 
suffered from an infection to the chosen pathogen) should be used in identifying these antigens in the 
second screening round. Of course, it is possible to use also less than 10 individual preparations, 
however, selectivity of the step may not be optimal with a low number of individual antibody 
preparations. On the other hand, if a given hyperimmune serum-reactive antigen (or an antigenic 
fragment thereof) is recognized by at least 10 individual antibody preparations, preferably at least 30, 
especially at least 50 individual antibody preparations, identification of the hyperimmune serum-reactive 
antigen is also selective enough for a proper identification. Hyperimmune serum-reactivity may of course 
be tested with as many individual preparations as possible (e.g. with more than 100 or even with more 
than 1,000). 

Therefore, the relevant portion of the hyperimmune serum-reactive antibody preparations according to 
the method of the present invention should preferably be at least 10, more preferred at least 30, especially 
at least 50 individual antibody preparations. Alternatively (or in combination) hyperimmune serum- 
reactive antigens may preferably be also identified with at least 20%, preferably at least 30%, especially at 
least 40% of all individual antibody preparations used in the second screening round. 

According to a preferred embodiment of the present invention, the sera from which the individual 
antibody preparations for the second round of screening are prepared (or which are used as antibody 
preparations), are selected by their titer against S. agalactiae (e.g. against a preparation of this pathogen, 
such as a lysate, cell wall components and recombinant proteins). Preferably, some are selected with a 
total IgA titer above 300 U, especially above 500 U, and/or an IgG titer above 5,000 U, especially above 
10,000 U (U = units, calculated from the OD 4 o 5 nm reading at a given dilution) when the whole organism 
(total lysate or whole cells) is used as antigen in the ELISA. 

The antibodies produced against streptococci by the human immune system and present in human sera 
are indicative of the in vivo expression of the antigenic proteins and their immunogenicity. The 
recognition of linear epitopes recognized by serum antibodies can be based on sequences as short as 4-5 
amino acids. Of course it does not necessarily mean that these short peptides are capable of inducing the 
given antibody in vivo. For that reason the defined epitopes, polypeptides and proteins are further to be 
tested in animals (mainly in mice) for their capacity to induce antibodies against the selected proteins in 
vivo. 

The preferred antigens are located on the cell surface or secreted, and are therefore accessible 
extracellularly. Antibodies against cell wall proteins are expected to serve multiple purposes: to inhibit 
adhesion, to interfere with nutrient acquisition, to inhibit immune evasion nand to promote phagocytosis 
{Hornef, M. et al., 2002}. Antibodies against secreted proteins are beneficial in neutralisation of their 
function as toxin or virulence component. It is also known that bacteria communicate with each other 
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through secreted proteins. Neutralizing antibodies against these proteins will interrupt growth- 
promoting cross-talk between or within streptococcal species. Bioinformatic analyses (signal sequences, 
cell wall localisation signals, transmembrane domains) proved to be very useful in assessing cell surface 
localisation or secretion. The experimental approach includes the isolation of antibodies with the 
corresponding epitopes and proteins from human serum, and the generation of immune sera in mice 
against (poly) peptides selected by the bacterial surface display screens. These sera are then used in a 
third round of screening as reagents in the following assays: cell surface staining of S. agalactia* grown 
under different conditions (FACS or microscopy), determination of neutralizing capacity (toxin, 
adherence), and promotion of opsonization and phagocytosis (in vitro phagocytosis assay). 

For that purpose, bacterial E, coli clones are directly injected into mice and immune sera are taken and 
tested in the relevant in vitro assay for functional opsonic or neutralizing antibodies. Alternatively, 
specific antibodies may be purified from human or mouse sera using peptides or proteins as substrate. 

Host defence against S. agalactiae relies mainly on opsonophagocytic killing mechanism. Inducing high 
affinity antibodies of the opsonic and neutralizing type by vaccination helps the innate immune system to 
eliminate bacteria and toxins. This makes the method according to the present invention an optimal tool 
for the identification of GBS antigenic proteins. 

The skin and mucous membranes are formidable barriers against invasion by streptococci. However, 
once the skin or the mucous membranes are breached the first line of non-adaptive cellular defence 
begins its co-ordinate action through complement and phagocytes, especially the polymorphonuclear 
leukocytes (PMNs). These cells can be regarded as the cornerstones in eliminating invading bacteria. As 
Streptococcus agalactiae is a primarily extracellular pathogen, the major anti-streptococcal adaptive 
response comes from the humoral arm of the immune system, and is mediated through three major 
mechanisms: promotion of opsonization, toxin neutralisation, and inhibition of adherence. It is believed 
that opsonization is especially important, because of its requirement for an effective phagocytosis. For 
efficient opsonization the microbial surface has to be coated with antibodies and complement factors for 
recognition by PMNs through receptors to the Fc fragment of the IgG molecule or to activated C3b. After 
opsonization, streptococci are phagocytosed and killed. Antibodies bound to specific antigens on the cell 
surface of bacteria serve as ligands for the attachment to PMNs and to promote phagocytosis. The very 
same antibodies bound to the adhesins and other cell surface proteins are expected to neutralize adhesion 
and prevent colonization. The selection of antigens as provided by the present invention is thus well 
suited to identify those that will lead to protection against infection in an animal model or in humans. 

According to the antigen identification method used herein, the present invention can surprisingly 
provide a set of comprehensive novel nucleic acids and novel hyperimmune serum reactive antigens and 
fragments thereof of S. agalactiae, among other things, as described below. According to one aspect, the 
invention particularly relates to the nucleotide sequences encoding hyperimmune serum reactive 
antigens which sequences are set forth in the Sequence listing Seq ID No: 1-217, 435-448 and 463-474 and 
the corresponding encoded amino acid sequences representing hyperimmune serum reactive antigens are 
set forth in the Sequence Listing Seq ID No 218-434, 449-462 and 475-486. 

In a preferred embodiment of the present invention, a nucleic acid molecule is provided which exhibits 
70% identity over their entire length to a nucleotide sequence set forth with Seq ID No 14, 90, 157-216.. 
Most highly preferred are nucleic acids that comprise a region that is at least 80% or at least 85% identical 
over their entire length to a nucleic acid molecule set forth with Seq ID No 14, 90, 157-216. In this regard, 
nucleic acid molecules at least 90%, 91%, 92%, 93%, 94%, 95%, or 96% identical over their entire length to 
the same are particularly preferred. Furthermore, those with at least 97% are highly preferred, those with 
at least 98% and at least 99% are particularly highly preferred, with at least 99% or 99.5% being the more 
preferred, with 100% identity being especially preferred. Moreover, preferred embodiments in this 
respect are nucleic acids which encode hyperimmune serum reactive antigens or fragments thereof 
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(polypeptides) which retain substantially the same biological function or activity as the mature 
polypeptide encoded by said nucleic acids set forth in the Seq ID No 14, 90, 157-216. 

Identity, as known in the art and used herein, is the relationship between two or more polypeptide 
sequences or two or more polynucleotide sequences, as determined by comparing the sequences. In the 
art, identity also means the degree of sequence relatedness between polypeptide or polynucleotide 
sequences, as the case may be, as determined by the match between strings of such sequences. Identity 
can be readily calculated. While there exist a number of methods to measure identity between two 
polynucleotide or two polypeptide sequences, the term is well known to skilled artisans (e.g. Sequence 
Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987). Preferred methods to determine 
identity are designed to give the largest match between the sequences tested. Methods to determine 
identity are codified in computer programs. Preferred computer program methods to determine identity 
between two sequences include, but are not limited to, GCG program package {Devereux, J. et al., 1984}, 
BLASTP, BLASTN, and FASTA {Altschul, S. et al., 1990}. 

According to another aspect of the invention, nucleic acid molecules are provided which exhibit 96 % or 
more than 96 %, especially 100 % sequence identity to the nucleic acid sequence set forth with Seq ID No 
1, 3, 5-13, 15, 18-25, 27-31, 33-36, 39-68, 70-85, 92-100, 103-126, 128-145, 147, 149-156, 217, 435-448 and 463- 
474. 

According to a further aspect of the present invention, nucleic acid molecules having 98 % or more than 
98%, escpecially 100 % sequence identity to a nucleic acid molecule are provided which are identical to 
the nucleic acid sequences set forth with Seq ID No 32, 86, 91, 101, 127. 

The nucleic acid molecules according to the present invention can as a second alternative also be a nucleic 
acid molecule which is at least essentially complementary to the nucleic acid described as the first 
alternative above. As used herein complementary means that a nucleic acid strand is base pairing via 
Watson-Crick base pairing with a second nucleic acid strand. Essentially complementary as used herein 
means that the base pairing is not occurring for all of the bases of the respective strands but leaves a 
certain number or percentage of the bases unpaired or wrongly paired. The percentage of correctly 
pairing bases is preferably at least 70 %, more preferably 80 %, even more preferably 90 % and most 
preferably any percentage higher than 90 %. It is to be noted that a percentage of 70 % matching bases is 
considered as homology and the hybridization having this extent of matching base pairs is considered as 
stringent. Hybridization conditions for this kind of stringent hybridization may be taken from Current 
Protocols in Molecular Biology (John Wiley and Sons, Inc., 1987). More particularly, the hybridization 
conditions can be as follows: 

• Hybridization performed e.g. in 5 x SSPE, 5 x Denhardt's reagent, 0.1% SDS, 100 g/mL sheared 
DNA at 68°C 

• Moderate stringency wash in 0.2xSSC, 0.1% SDS at 42°C 

• High stringency wash in O.lxSSC, 0.1% SDS at 68°C 

Genomic DNA with a GC content of 50% has an approximate Tm of 96°C. For 1% mismatch, the Tm is 
reduced by approximately 1°C. 

In addition, any of the further hybridization conditions described herein are in principle applicable as 
well. 

Of course, all nucleic acid sequence molecules which encode the same polypeptide molecule as those 
identified by the present invention are encompassed by any disclosure of a given coding sequence, since 
the degeneracy of the genetic code is directly applicable to unambiguously determine all possible nucleic 
acid molecules which encode a given polypeptide molecule, even if the number of such degenerated 
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nucleic acid molecules may be high. This is also applicable for fragments of a given polypeptide, as long 
as the fragments encode a polypeptide being suitable to be used in a vaccination connection, e.g. as an 
active or passive vaccine. 

The nucleic acid molecule according to the present invention can as a third alternative also be a nucleic 
acid which comprises a stretch of at least 15 bases of the nucleic acid molecule according to the first and 
second alternative of the nucleic acid molecules according to the present invention as outlined above. 
Preferably, the bases form a contiguous stretch of bases. However, it is also within the scope of the 
present invention that the stretch consists of two or more moieties, which are separated by a number of 
bases. 

The present nucleic acids may preferably consist of at least 20, even more preferred at least 30, especially 
at least 50 contiguous bases from the sequences disclosed herein. The suitable length may easily be 
optimized due to the planned area of use (e.g. as (PGR) primers, probes, capture molecules (e.g. on 
a (DNA) chip), etc.). Preferred nucleic acid molecules contain at least a contiguous 15 base portion of one 
or more of the predicted immunogenic amino acid sequences listed in tables 1 and 2, especially the 
sequences of table 2 with scores of more than 10, preferably more than 20, especially with a score of more 
than 25. Specifically preferred are nucleic acids containing a contiguous portion of a DNA sequence of 
any sequence in the sequence protocol of the present application which shows 1 or more, preferably more 
than 2, especially more than 5, non-identical nucleic acid residues compared to the published 
Streptococcus agalactiae strain NEM316 (ATCC 12403) genome ({Glaser, P. et al., 2002}; GenBank accession 
AL732656) and/or any other published S. agalactiae genome sequence or parts thereof, especially of the 
serotype V 2603 V/R (A909) strain {Tettelin, H. et al., 2002}); GenBank accession AE009948). Specifically 
preferred non-identical nucleic acid residues are residues, which lead to a non-identical amino acid 
residue. Preferably, the nucleic acid sequences encode polypeptides having at least 1, preferably at least 2, 
preferably at least 3 different amino acid residues compared to the published S. agalactiae counterparts 
mentioned above. Also such isolated polypeptides, being fragments of the proteins (or the whole protein) 
mentioned herein e.g. in the sequence listing, having at least 6, 7, or 8 amino acid residues and being 
encoded by these nucleic acids are preferred. 

The nucleic acid molecule according to the present invention can as a fourth alternative also be a nucleic 
acid molecule which anneals under stringent hybridisation conditions to any of the nucleic acids of the 
present invention according to the above outlined first, second, and third alternative. Stringent 
hybridisation conditions are typically those described herein. 

Finally, the nucleic acid molecule according to the present invention can as a fifth alternative also be a 
nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to any of the 
nucleic acid molecules according to any nucleic acid molecule of the present invention according to the 
first, second, third, and fourth alternative as outlined above. This kind of nucleic acid molecule refers to 
the fact that preferably the nucleic acids according to the present invention code for the hyperimmune 
serum reactive antigens or fragments thereof according to the present invention. This kind of nucleic acid 
molecule is particularly useful in the detection of a nucleic acid molecule according to the present 
invention and thus the diagnosis of the respective microorganisms such as S. agalactiae and any disease or 
diseased condition where this kind of microorganims is involved. Preferably, the hybridisation would 
occur or be preformed under stringent conditions as described in connection with the fourth alternative 
described above. 

Nucleic acid molecule as used herein generally refers to any ribonucleic acid molecule or 
deoxyribonucleic acid molecule, which may be unmodified RNA or DNA or modified RNA or DNA. 
Thus, for instance, nucleic acid molecule as used herein refers to, among other, single-and double- 
stranded DNA, DNA that is a mixture of single- and double-stranded RNA, and RNA that is a mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single- 
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stranded or, more typically, double-stranded, or triple-stranded, or a mixture of single- and double- 
stranded regions. In addition, nucleic acid molecule as used herein refers to triple-stranded regions 
comprising RNA or DNA or both RNA and DNA. The strands in such regions may be from the same 
molecule or from different molecules. The regions may include all of one or more of the molecules, but 
more typically involve only a region of some of the molecules. One of the molecules of a triple-helical 
region often is an oligonucleotide. As used herein, the term nucleic acid molecule includes DNAs or 
RNAs as described above that contain one or more modified bases. Thus, DNAs or RNAs with backbones 
modified for stability or for other reasons are "nucleic acid molecule" as that term is intended herein. 
Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as 
tritylated bases, to name just two examples, are nucleic acid molecule as the term is used herein. It will be 
appreciated that a great variety of modifications have been made to DNA and RNA that serve many 
useful purposes known to those of skill in the art. The term nucleic acid molecule as it is employed herein 
embraces such chemically, enzymatically or metabolically modified forms of nucleic acid molecule, as 
well as the chemical forms of DNA and RNA characteristic of viruses and cells, including simple and 
complex cells, inter alia. The term nucleic acid molecule also embraces short nucleic acid molecules often 
referred to as oligonucleotide(s). "Polynucleotide" and "nucleic acid" or "nucleic acid molecule" are often 
used interchangeably herein. 

Nucleic acid molecules provided in the present invention also encompass numerous unique fragments, 
both longer and shorter than the nucleic acid molecule sequences set forth in the sequencing listing of the 
S. agalactiae coding regions, which can be generated by standard cloning methods. To be unique, a 
fragment must be of sufficient size to distinguish it from other known nucleic acid sequences, most 
readily determined by comparing any selected S. agalactiae fragment to the nucleotide sequences in 
computer databases such as GenBank. 

Additionally, modifications can be made to the nucleic acid molecules and polypeptides that are 
encompassed by the present invention. For example, nucleotide substitutions can be made which do not 
affect the polypeptide encoded by the nucleic acid, and thus any nucleic acid molecule which encodes a 
hyperimmune serum reactive antigen or fragments thereof is encompassed by the present invention. 

Furthermore, any of the nucleic acid molecules encoding hyperimmune serum reactive antigens or 
fragments thereof provided by the present invention can be functionally linked, using standard 
techniques such as standard cloning techniques, to any desired regulatory sequences, whether a S. 
agalactiae regulatory sequence or a heterologous regulatory sequence, heterologous leader sequence, 
heterologous marker sequence or a heterologous coding sequence to create a fusion protein. 

Nucleic acid molecules of the present invention may be in the form of RNA, such as mRNA or cRNA, or 
in the form of DNA, including, for instance, cDNA and genomic DNA obtained by cloning or produced 
by chemical synthetic techniques or by a combination thereof. The DNA may be triple-stranded, double- 
stranded or single-stranded. Single-stranded DNA may be the coding strand, also known as the sense 
strand, or it may be the non-coding strand, also referred to as the anti-sense strand. 

The present invention further relates to variants of the herein above described nucleic acid molecules 
which encode fragments, analogs and derivatives of the hyperimmune serum reactive antigens and 
fragments thereof having a deducted S. agalactiae amino acid sequence set forth in the Sequence Listing. A 
variant of the nucleic acid molecule may be a naturally occurring variant such as a naturally occurring 
allelic variant, or it may be a variant that is not known to occur naturally. Such non-naturally occurring 
variants of the nucleic acid molecule may be made by mutagenesis techniques, including those applied to 
nucleic acid molecules, cells or organisms. 

Among variants in this regard are variants that differ from the aforementioned nucleic acid molecules by 
nucleotide substitutions, deletions or additions. The substitutions, deletions or additions may involve one 
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or more nucleotides. The variants may be altered in coding or non-coding regions or both. Alterations in 
the coding regions may produce conservative or non-conservative amino acid substitutions, deletions or 
additions. Preferred are nucleic acid molecules encoding a variant, analog, derivative or fragment, or a 
variant, analogue or derivative of a fragment, which have a S. agalactiae sequence as set forth in the 
Sequence Listing, in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid(s) is substituted, 
deleted or added, in any combination. Especially preferred among these are silent substitutions, additions 
and deletions, which do not alter the properties and activities of the S. agalactiae polypeptides set forth in 
the Sequence Listing. Also especially preferred in this regard are conservative substitutions. 

The peptides and fragments according to the present invention also include modified epitopes wherein 
preferably one or two of the amino acids of a given epitope are modified or replaced according to the 
rules disclosed in e.g. {Tourdot, S. et al v 2000}, as well as the nucleic acid sequences encoding such 
modified epitopes. 

It is clear that also epitopes derived from the present epitopes by amino acid exchanges improving, 
conserving or at least not significantly impeding the T cell activating capability of the epitopes are 
covered by the epitopes according to the present invention. Therefore the present epitopes also cover 
epitopes, which do not contain the original sequence as derived from S. agalactiae, but trigger the same or 
preferably an improved T cell response. These epitope are referred to as "heteroclitic"; they need to have a 
similar or preferably greater affinity to MHC/HLA molecules, and the need the ability to stimulate the T 
cell receptors (TCR) directed to the original epitope in a similar or preferably stronger manner. 

Heteroclitic epitopes can be obtained by rational design i.e. taking into account the contribution of 
individual residues to binding to MHC/HLA as for instance described by {Rammensee, H. et al., 1999}, 
combined with a systematic exchange of residues potentially interacting with the TCR and testing the 
resulting sequences with T cells directed against the original epitope. Such a design is possible for a 
skilled man in the art without much experimentation. 

Another possibility includes the screening of peptide libraries with T cells directed against the original 
epitope. A preferred way is the positional scanning of synthetic peptide libraries. Such approaches have 
been described in detail for instance by {Hemmer, B. et al., 1999}and the references given therein. 

As an alternative to epitopes represented by the present derived amino acid sequences or heteroclitic 
epitopes, also substances mimicking these epitopes e.g. "peptidemimetica" or "retro-inverso-peptides" can 
be applied. 

Another aspect of the design of improved epitopes is their formulation or modification with substances 
increasing their capacity to stimulate T cells. These include T helper cell epitopes, lipids or liposomes or 
preferred modifications as described in WO 01/78767. 

Another way to increase the T cell stimulating capacity of epitopes is their formulation with immune 
stimulating substances for instance cytokines or chemokines like interleukin-2, -7, -12, -18, class I and II 
interferons (IFN), especially IFN-gamma, GM-CSF, TNF-alpha, flt3-ligand and others. 

As discussed additionally herein regarding nucleic acid molecule assays of the invention, for instance, 
nucleic acid molecules of the invention as discussed above, may be used as a hybridization probe for 
RNA, cDNA and genomic DNA to isolate full-length cDNAs and genomic clones encoding polypeptides 
of the present invention and to isolate cDNA and genomic clones of other genes that have a high 
sequence similarity to the nucleic acid molecules of the present invention. Such probes generally will 
comprise at least 15 bases. Preferably, such probes will have at least 20, at least 25 or at least 30 bases, and 
may have at least 50 bases. Particularly preferred probes will have at least 30 bases, and will have 50 
bases or less, such as 30, 35, 40, 45, or 50 bases. 
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For example, the coding region of a nucleic acid molecule of the present invention may be isolated by 
screening a relevant library using the known DNA sequence to synthesize an oligonucleotide probe. A 
labeled oligonucleotide having a sequence complementary to that of a gene of the present invention is 
then used to screen a library of cDNA, genomic DNA or mRNA to determine to which members of the 
library the probe hybridizes. 

The nucleic acid molecules and polypeptides of the present invention may be employed as reagents and 
materials for development of treatments of and diagnostics for disease, particularly human disease, as 
further discussed herein relating to nucleic acid molecule assays, inter alia. 

The nucleic acid molecules of the present invention that are oligonucleotides can be used in the processes 
herein as described, but preferably for PGR, to determine whether or not the S. agalactiae genes identified 
herein in whole or in part are present and/or transcribed in infected tissue such as blood. It is recognized 
that such sequences will also have utility in diagnosis of the stage of infection and type of infection the 
pathogen has attained. For this and other purposes the arrays comprising at least one of the nucleic acids 
according to the present invention as described herein, may be used. 

The nucleic acid molecules according to the present invention may be used for the detection of nucleic 
acid molecules and organisms or samples containing these nucleic acids. Preferably such detection is for 
diagnosis, more preferable for the diagnosis of a disease related or linked to the present or abundance of 
S. agalactiae. 

Eukaryotes (herein also "individual(s)"), particularly mammals, and especially humans, infected with S. 
agalactiae may be identifiable by detecting any of the nucleic acid molecules according to the present 
invention detected at the DNA level by a variety of techniques. Preferred nucleic acid molecules 
candidates for distinguishing a S. agalactiae from other organisms can be obtained. 

The invention provides a process for diagnosing disease, arising from infection with S. agalactiae, 
comprising determining from a sample isolated or derived from an individual an increased level of 
expression of a nucleic acid molecule having the sequence of a nucleic acid molecule set forth in the 
Sequence Listing. Expression of nucleic acid molecules can be measured using any one of the methods 
well known in the art for the quantitation of nucleic acid molecules, such as, for example, PCR, RT-PCR, 
Rnase protection, Northern blotting, other hybridisation methods and the arrays described herein. 

Isolated as used herein means separated "by the hand of man" from its natural state; i.e., that, if it occurs 
in nature, it has been changed or removed from its original environment, or both. For example, a 
naturally occurring nucleic acid molecule or a polypeptide naturally present in a living organism in its 
natural state is not "isolated," but the same nucleic acid molecule or polypeptide separated from the 
coexisting materials of its natural state is "isolated", as the term is employed herein. As part of or 
following isolation, such nucleic acid molecules can be joined to other nucleic acid molecules, such as 
DNAs, for mutagenesis, to form fusion proteins, and for propagation or expression in a host, for instance. 
The isolated nucleic acid molecules, alone or joined to other nucleic acid molecules such as vectors, can be 
introduced into host cells, in culture or in whole organisms. Introduced into host cells in culture or in 
whole organisms, such DNAs still would be isolated, as the term is used herein, because they would not 
be in their naturally occurring form or environment. Similarly, the nucleic acid molecules and 
polypeptides may occur in a composition, such as a media formulations, solutions for introduction of 
nucleic acid molecules or polypeptides, for example, into cells, compositions or solutions for chemical or 
enzymatic reactions, for instance, which are not naturally occurring compositions, and, therein remain 
isolated nucleic acid molecules or polypeptides within the meaning of that term as it is employed herein. 

The nucleic acids according to the present invention may be chemically synthesized. Alternatively, the 
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nucleic acids can be isolated from S. agalactiae by methods known to the one skilled in the art. 

According to another aspect of the present invention, a comprehensive set of novel hyperimmune serum 
reactive antigens and fragments thereof are provided by using the herein described antigen identification 
method. In a preferred embodiment of the invention, a hyperimmune serum-reactive antigen comprising 
an amino acid sequence being encoded by any one of the nucleic acids molecules herein described and 
fragments thereof are provided. In another preferred embodiment of the invention a novel set of 
hyperimmune serum-reactive antigens which comprises amino acid sequences selected from a group 
consisting of the polypeptide sequences as represented in Seq ID No 231, 307, 374-433 and fragments 
thereof are provided. In a further preferred embodiment of the invention hyperimmune serum-reactive 
antigens which comprise amino acid sequences selected from a group consisting of the polypeptide 
sequences as represented in Seq ID No 218, 220, 222-230, 232, 235-242, 244-248, 250-253, 256-285, 287-302, 
309-317, 320-343, 345-362, 364, 366-373, 434, 449-462 and 475-486 and fragments thereof are provided. In a 
still preferred embodiment of the invention hyperimmune serum-reactive antigens which comprise 
amino acid sequences selected from a group consisting of the polypeptide sequences as represented in 
Seq ID No 249, 303, 308, 318, 344 and fragments thereof are provided. 

The hyperimmune serum reactive antigens and fragments thereof as provided in the invention include 
any polypeptide set forth in the Sequence Listing as well as polypeptides which have at least 70% identity 
to a polypeptide set forth in the Sequence Listing, preferably at least 80% or 85% identity to a polypeptide 
set forth in the Sequence Listing, and more preferably at least 90% similarity (more preferably at least 
90% identity) to a polypeptide set forth in the Sequence Listing and still more preferably at least 95%, 
96%, 97%, 98%, 99% or 99.5% similarity (still more preferably at least 95%, 96%, 97%, 98%, 99%, or 99.5% 
identity) to a polypeptide set forth in the Sequence Listing and also include portions of such polypeptides 
with such portion of the polypeptide generally containing at least 4 amino acids and more preferably at 
least 8, still more preferably at least 30, still more preferably at least 50 amino acids, such as 4, 8, 10, 20, 
30, 35, 40, 45 or 50 amino acids. 

The invention also relates to fragments, analogs, and derivatives of these hyperimmune serum reactive 
antigens and fragments thereof. The terms "fragment", "derivative" and "analog" when referring to an 
antigen whose amino acid sequence is set forth in the Sequence Listing, means a polypeptide which 
retains essentially the same or a similar biological function or activity as such hyperimmune serum 
reactive antigen and fragment thereof. 

The fragment, derivative or analog of a hyperimmune serum reactive antigen and fragment thereof may 
be 1) one in which one or more of the amino acid residues are substituted with a conserved or non- 
conserved amino acid residue (preferably a conserved amino acid residue) and such substituted amino 
acid residue may or may not be one encoded by the genetic code, or 2) one in which one or more of the 
amino acid residues includes a substituent group, or 3) one in which the mature hyperimmune serum 
reactive antigen or fragment thereof is fused with another compound, such as a compound to increase the 
half-life of the hyperimmune serum reactive antigen and fragment thereof (for example, polyethylene 
glycol), or 4) one in which the additional amino acids are fused to the mature hyperimmune serum 
reactive antigen or fragment thereof, such as a leader or secretory sequence or a sequence which is 
employed for purification of the mature hyperimmune serum reactive antigen or fragment thereof or a 
proprotein sequence. Such fragments, derivatives and analogs are deemed to be within the scope of 
those skilled in the art from the teachings herein. 

The present invention also relates to antigens of different S. agalactiae isolates. Such homologues 
may easily be isolated based on the nucleic acid and amino acid sequences disclosed herein. There are 9 
serotypes distinguished to date and the typing is based on serotype specific antisera. The presence of any 
antigen can accordingly be determined for every serotype. In addition it is possible to determine the 
variability of a particular antigen in the various serotypes as described for the S. pyogenes sic gene {Hoe, 
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N. et al., 2001}. The contribution of the various serotypes to the different GBS infections varies in the 
different age groups and geographical regions. It is an important aspect that the most valuable protective 
antigens are expected to be conserved among various clinical strains. 

Among the particularly preferred embodiments of the invention in this regard are the hyperimmune 
serum reactive antigens set forth in the Sequence Listing, variants, analogs, derivatives and fragments 
thereof, and variants, analogs and derivatives of fragments. Additionally, fusion polypeptides 
comprising such hyperimmune serum reactive antigens, variants, analogs, derivatives and fragments 
thereof, and variants, analogs and derivatives of the fragments are also encompassed by the present 
invention. Such fusion polypeptides and proteins, as well as nucleic acid molecules encoding them, can 
readily be made using standard techniques, including standard recombinant techniques for producing 
and expression of a recombinant polynucleic acid encoding a fusion protein. 

Among preferred variants are those that vary from a reference by conservative amino acid substitutions. 
Such substitutions are those that substitute a given amino acid in a polypeptide by another amino acid of 
like characteristics. Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu and He; interchange of the hydroxyl residues Ser and Thr, 
exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gin, 
exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe and Tyr. 

Further particularly preferred in this regard are variants, analogs, derivatives and fragments, and 
variants, analogs and derivatives of the fragments, having the amino acid sequence of any polypeptide 
set forth in the Sequence Listing, in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid 
residues are substituted, deleted or added, in any combination. Especially preferred among these are 
silent substitutions, additions and deletions, which do not alter the properties and activities of the 
polypeptide of the present invention. Also especially preferred in this regard are conservative 
substitutions. Most highly preferred are polypeptides having an amino acid sequence set forth in the 
Sequence Listing without substitutions. 

The hyperimmune serum reactive antigens and fragments thereof of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

Also among preferred embodiments of the present invention are polypeptides comprising fragments of 
the polypeptides having the amino acid sequence set forth in the Sequence Listing, and fragments of 
variants and derivatives of the polypeptides set forth in the Sequence Listing. 

In this regard a fragment is a polypeptide having an amino acid sequence that entirely is the same as part 
but not all of the amino acid sequence of the afore mentioned hyperimmune serum reactive antigen and 
fragment thereof, and variants or derivative, analogs, fragments thereof. Such fragments may be "free- 
standing", i.e., not part of or fused to other amino acids or polypeptides, or they may be comprised 
within a larger polypeptide of which they form a part or region. Also preferred in this aspect of the 
invention are fragments characterised by structural or functional attributes of the polypeptide of the 
present invention, i.e. fragments that comprise alpha-helix and alpha-helix forming regions, beta-sheet 
and beta-sheet forming regions, turn and turn-forming regions, coil and coil-forming regions, hydrophilic 
regions, hydrophobic regions, alpha amphipathic regions, beta-amphipathic regions, flexible regions, 
surface-forming regions, substrate binding regions, and high antigenic index regions of the polypeptide 
of the present invention, and combinations of such fragments. Preferred regions are those that mediate 
activities of the hyperimmune serum reactive antigens and fragments thereof of the present invention. 
Most highly preferred in this regard are fragments that have a chemical, biological or other activity of the 
hyperimmune serum reactive antigen and fragments thereof of the present invention, including those 
with a similar activity or an improved activity, or with a decreased undesirable activity. Particularly 
preferred are fragments comprising receptors or domains of enzymes that confer a function essential for 
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viability of S. agalactiae or the ability to cause disease in humans. Further preferred polypeptide 
fragments are those that comprise or contain antigenic or immunogenic determinants in an animal, 
especially in a human. 

An antigenic fragment is defined as a fragment of the identified antigen, which is for itself antigenic or 
may be made antigenic when provided as a hapten. Therefore, also antigens or antigenic fragments 
showing one or (for longer fragments) only a few amino acid exchanges are enabled with the present 
invention, provided that the antigenic capacities of such fragments with amino acid exchanges are not 
severely deteriorated on the exchange(s), i.e., suited for eliciting an appropriate immune response in an 
individual vaccinated with this antigen and identified by individual antibody preparations from 
individual sera. 

Preferred examples of such fragments of hyperimmune serum-reactive antigens selected from the group 
consisting of peptides comprising amino acid sequences of column "predicted immunogenic aa" and 
"location of identified immunogenic region" of Table 1A, especially peptides comprising amino acid 4- 
20, 35-44, 65-70, 73-87, 92-98, 112-137, 152-161, 177-186, 193-200, 206-213, 229-255, 282-294, 308-313, 320- 
326, 349-355, 373-384, 388-406, 420-425 and 115-199 of Seq ID No 218; 5-24, 35-41, 44-70, 73-89, 103-109, 
127-143, 155-161, 185-190, 192-207, 212-219, 246-262, 304-336, 372-382, 384-393, 398-407, 412-418, 438-444, 
1-75, 76-161 and 164-239 of Seq ID No 219; 4-10, 16-58, 60-71, 77-92, 100-126, 132-146, 149-164, 166-172, 
190-209, 214-220, 223-229, 241-256, 297-312, 314-319, 337-343, 351-359, 378-387, 398-418, 421-428, 430-437, 
440-448' 462-471, 510-519, 525-536, 552-559, 561-568, 573-582, 596-602, 608-630, 637-649, 651-665, 681-702, 
714-732, 739-745, 757-778, 790-805, 807-815, 821-829, 836-842, 846-873, 880-903, 908-914, 916-923, 931-940, 
943-948, 956-970, 975-986, 996-1015, 1031-1040, 1051-1069, 1072-1095, 1114-1119, 1130-1148, 1150-1157, 
1169-1176, 1229-1238 and 802-812 of Seq ID No 220; 5-12, 14-26, 35-47, 52-67, 72-78, 83-98, 121-141, 152- 
159, 163-183, 186-207, 209-257, 264-277, 282-299, 301-309, 312-318, 324-339, 358-368, 372-378, 387-397, 425- 
43l'and 46-291 of Seq ID No 221; 29-38, 44-64, 70-76, 78-87, 94-100, 102-112, 119-134, 140-149, 163-173, 
178-186, 188-194, 207-234, 247-262, 269-290 and 73-92 of Seq ID No 222; 10-28, 36-63, 77-87, 103-119, 127- 
136, 141-169, 171-183, 195-200, 207-232, 236-246, 251-265, 268-283, 287-297, 314-322, 335-343, 354-363, 384- 
390, 405-411, 419-436, 443-455, 467-473, 480-513, 518-529, 550-557, 565-585, 602-608, 616-625, 632-660, 665- 
677' 685-701, 726-736, 738-747, 752-761, 785-796, 801-813, 838-853, 866-871 and 757-774 of Seq ID No 223; 
31-38, 61-66, 74-81, 90-115, 123-145, 154-167, 169-179, 182-193, 200-206, 238-244, 267-272 and 235-251 of 
Seq ID No 224; 19-25, 38-54, 56-64, 66-72, 74-92, 94-100, 116-129, 143-149, 156-183, 204-232, 253-266, 269- 
275, 294-307 and 241-313 of Seq ID No 225; 5-34, 50-56, 60-65, 74-85, 89-97, 108-119, 159-165, 181-199, 209- 
225, 230-240, 245-251, 257-262, 274-282, 300-305 and 64-75 of Seq ID No 226; 5-13, 16-21, 27-42, 45-52, 58- 
66, 74-87, 108-114, 119-131 and 39-51 of Seq ID No 227; 6-23, 46-54, 59-65, 78-84, 100-120, 128-133, 140-146, 
159-165, 171-183, 190-204, 224-232, 240-248, 250-259, 274-280, 288-296, 306-315 and 267-274 of Seq ID No 
228; 5-12, 15-24, 26-36, 42-65, 68-80, 82-104, 111-116, 125-144, 159-167, 184-189, 209-218, 235-243, 254-265, 
269-283, 287-300, 306-316, 318-336, 338-352, 374-392 and 162-174 of Seq ID No 229; 30-42, 45-54 and 25-37 
of Seq ID No 230; 10-30, 53-59, 86-95, 116-130, 132-147, 169-189, 195-201, 212-221, 247-256, 258-265, 278- 
283, 291-298, 310-316, 329-339, 341-352, 360-367, 388-396, 398-411, 416-432, 443-452, 460-466, 506-512, 515- 
52l', 542-548 and 419-431 of Seq ID No 231; 4-27, 30-53, 60-67, 70-90, 92-151, 159-185, 189-195, 198-210, 
215-239 and 173-189 of Seq ID No 232; 4-26, 41-54, 71-78, 116-127, 140-149, 151-158, 161-175, 190-196, 201- 
208, 220-226, 240-252, 266-281, 298-305, 308-318, 321-329, 344-353, 372-378, 384-405, 418-426, 429-442, 457- 
463, 494-505, 514-522 and 174-188 of Seq ID No 233; 17-25, 27-39, 61-67, 81-89, 99-110, 120-131, 133-139, 
147-161, 167-172, 179-185, 192-198, 203-213, 226-238, 243-258, 261-267, 284-290, 296-307, 311-328, 340-352, 
356-371 and 239-256 of Seq ID No 234; 8-30, 40-49, 67-80, 114-123, 126-142, 152-162, 188-194 and 57-70 of 
Seq ID No 235; 4-23, 28-34, 36-47, 50-61, 76-81, 89-94, 96-104, 112-119, 126-146, 155-181, 195-200, 208-214, 
220-229, 244-260, 263-276, 282-288, 292-300, 317-323, 336-351, 353-359, 363-375, 382-399, 415-432, 444-455, 
458-471, 476-481, 484-492, 499-517, 522-529, 535-541, 543-568, 572-584, 586-600, 607-617, 626-637, 656-675 
and 282-297 of Seq ID No 236; 6-24, 30-35, 38-45, 63-91, 134-140, 146-160, 167-188, 214-220, 226-234, 244- 
250, 260-270, 286-301, 316-329, 340-371, 429-446, 448-459, 474-481, 485-491, 512-526, 537-544, 550-565, 573- 
583, 596-613, 621-630, 652-658 and 87-97 of Seq ID No 237; 8-20, 26-48, 56-67, 76-86, 94-109, 115-121, 123- 
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129 143-160 178-186, 191-198, 201-208, 221-236, 238-244, 260-268 and 237-247 of Seq ID No 238; 4-40, 42- 
57, 73-87, 98-117, 126-135, 150-156, 166-174, 196-217, 231-236, 248-258, 276-284, 293-301, 307-313, 339-347, 
359-365 375-387, 395-402, 428-440, 445-456, 485-490, 497-505, 535-541, 547-555, 610-625, 648-656, 665-671 
and 448-528 of Seq ID No 239; 10-18, 39-45, 51-61, 80-96, 98-106, 110-115, 158-172, 174-183, 191-200, 220- 
237, 249-255, 274-289, 308-324, 331-341, 372-381, 384-397, 405-414 and 322-338 of Seq ID No 240; 30-36, 38- 
^6 85-108 134-147, 149-160, 163-183, 188-201, 206-211, 219-238, 247-254 and 5-13 of Seq ID No 241; 11-40, 
98-103, 110-115, 133-145, 151-159, 172-179, 192-201, 204-212, 222-228, 235-245, 258-268, 283-296, 298-309, 
322-329 342-351, 354-362, 372-378, 385-393, 407-418, 495-516 and 1-148 of Seq ID No 242; 5-19, 21-36, 73- 

94 112-119, 122-137, 139-145, 152-167, 184-190, 198-204, 208-224, 249-265, 267-281, 299-304, 309-317, 326- 
333, 356-364, 368-374, 381-389, 391-414, 419-425, 430-435 and 113-140 of Seq ID No 243; 45-54, 59-67, 78-91 
and 15-23 of Seq ID No 244; 11-22, 33-47, 52-80, 88-112, 124-129 and 6-25 of Seq ID No 245; 26-41, 51-63, 
80-89 93-115, 150-163, 187-193, 220-237, 240-249, 286-294, 296-306, 316-329, 345-353, 361-370, 407-425, 428- 
437 474-482 484-494, 504-517, 533-541, 549-558, 595-613, 616-625, 660-668, 673-685, 711-726, 736-744, 749- 
761, 787-802, 812-820, 825-837, 863-878, 888-896, 901-913, 939-954, 964-972, 977-989, 1003-1008, 1016-1022, 
1028-1034, 1041-1053, 1059-1074, 1101-1122, 420-511 and 581-704 of Seq ID No 246; 18-25, 27-55, 71-83, 89- 

95 102-113, 120-146, 150-156, 174-185 and 159-175 of Seq ID No 247; 24-30, 38-56, 63-68, 87-93, 136-142, 
153-164, 183-199, 213-219, 226-234, 244-261, 269-278, 283-289, 291-297, 320-328, 330-336, 340-346, 348-356, 
358-366^ 382-387, 401-408, 414-419, 449-455, 468-491, 504-512, 531-537, 554-560, 597-608, 621-627, 632-643, 
650-662, 667-692, 703-716, 724-737, 743-758, 783-794, 800-818, 846-856 and 806-884 of Seq ID No 248; 4-14, 
21-39 86-92, 99-107, 121-131, 136-144, 147-154, 158-166, 176-185, 193-199, 207-222, 224-230 and 117-136 of 
Seq ID No 249; 65-76, 85-97, 103-109, 115-121, 125-146, 163-169, 196-205, 212-219, 228-237, 241-247, 254- 
262 269-288 294-303, 305-313, 328-367, 395-401, 405-412, 418-429, 437-447, 481-488, 506-513, 519-524, 530- 
541^ 546-557 and 266-284 of Seq ID No 250; 5-14, 37-42, 49-71, 78-92, 97-112, 127-136, 147-154, 156-163, 
186-198 216-225, 233-243, 248-253, 295-307, 323-332, 359-366, 368-374, 380-398 and 194-223 of Seq ID No 
251; 4-11, 33-39, 45-72, 100-113, 119-129, 136-144, 169-175, 177-185, 200-208, 210-219, 262-276, 278-297, 320- 
326' 336-344, 347-362, 381-394, 443-453 and 438-454 of Seq ID No 252; 4-29, 31-52, 55-61, 95-110, 138-158, 

162- 171 179-187, 202-229, 239-248, 251-256, 262-267, 269-285, 304-310, 351-360, 362-368, 381-388, 415-428, 
435-440', 448-458 and 161-178 of Seq ID No 253; 4-17, 19-28, 32-43, 47-59, 89-110, 112-126, 128-134, 140-148, 
152-161, 169-184, 191-204, 230-235, 255-264, 328-338, 341-347, 401-409, 413-419, 433-441, 449-458, 463-468, 
476-482, 486-492, 500-506, 529-545 and 305-381 of Seq ID No 254; 10-29, 38-45, 53-61, 134-145, 152-160, 

163- 170, 202-208, 219-229, 248-258, 266-275, 282-288, 315-320, 328-334, 377-385, 392-402, 418-424, 447-453, 
460-471, 479-487, 491-497, 500-507, 531-537, 581-594, 615-623, 629-635, 644-652, 659-666, 668-678, 710-717, 
719-728, 736-741, 747-760, 766-773, 784-789, 794-800, 805-817, 855-861, 866-887 and 698-715 of Seq ID No 
255- 16-26 29-37, 44-58, 62-68, 74-80, 88-95, 97-120, 125-144, 165-196 and 58-72 of Seq ID No 256; 14-21, 23- 
46, 49-60, 63-74, 78-92, 96-103, 117-129, 134-161, 169-211, 217-231, 239-248, 252-281, 292-299, 313-343 and 
243-257 of Seq ID No 257; 11-27, 46-52, 67-72, 76-84, 91-112, 116-153, 160-175, 187-196, 202-211, 213-220 
and 43-76 of Seq ID No 258; 5-29, 37-56, 78-86, 108-118, 152-161 and 120-130 of Seq ID No 259; 8-14, 19- 
41, 52-66, 75-82, 87-92, 106-121, 127-133, 136-143, 158-175, 180-187, 196-204, 221-228, 239-245, 259-265, 291- 
306, 318-323, 328-340, 352-358, 361-368, 375-381, 391-399, 411-418, 431-442, 446-455, 484-496, 498-510, 527- 
533, 541-549, 558-565, 575-585, 587-594, 644-655, 661-668, 671-677 and 184-196 of Seq ID No 260; 4-22, 29- 
38, 55-62, 75-81, 102-107, 110-134, 143-150, 161-167, 172-179, 191-215, 223-233, 241-247, 251-264, 266-272, 
288-309, 340-352, 354-366, 394-402, 414-438 and 198-218 of Seq ID No 261; 24-44, 49-70, 80-91, 105-118, 
128-136 140-154 and 77-92 of Seq ID No 262; 5-22, 31-36, 41-47, 67-74, 83-90, 105-122, 135-143, 160-167 and 
118-129 of Seq ID No 263; 4-25, 33-73, 81-93, 96-106, 114-120, 122-128, 130-172, 179-208, 210-241, 251-283, 
296-301 and 92-100 of Seq ID No 264; 14-24, 29-38, 43-50, 52-72, 86-97, 101-107, 110-125, 127-141, 145-157, 
168-175, 177-184, 186-195, 205-226, 238-250, 255-261, 284-290, 293-304, 307-314, 316-323, 325-356, 363-371, 
383-390, 405-415, 423-432, 442-454, 466-485, 502-511, 519-527, 535-556, 558-565, 569-574, 612-634, 641-655, 
672-686, 698-709, 715-722, 724-732, 743-753, 760-769, 783-792, 818-825, 830-839, 842-849, 884-896, 905-918, 
926-940, 957-969, 979-1007, 1015-1021, 1049-1057 and 336-349 of Seq ID No 265; 6-16, 26-31, 33-39, 62-73, 
75-85, 87-100, 113-123, 127-152, 157-164, 168-181, 191-198, 208-214, 219-226, 233-254, 259-266, 286-329 and 
181-195 of Seq ID No 266; 4-13, 32-39, 53-76, 99-108, 110-116, 124-135, 137-146, 149-157, 162-174, 182-190, 
207-231, 242-253, 255-264, 274-283, 291-323, 334-345, 351-360, 375-388, 418-425, 456-474, 486-492, 508-517, 
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520-536, 547-560, 562-577, 31-45 and 419-443 of Seq ID No 267; 15-26, 30-37, 42-49, 58-90, 93-99, 128-134, 
147-154, 174-179, 190-197, 199-205, 221-230, 262-274, 277-287, 300-314, 327-333, 343-351, 359-377, 388-396, 
408-413, 416-425, 431-446 and 246-256 of Seq ID No 268; 5-26, 34-42, 47-54, 61-67, 71-104, 107-115, 131-138, 
144-153, 157-189, 196-202, 204-210, 228-245, 288-309, 316-329, 332-341, 379-386, 393-399, 404-412, 414-421, 
457-468, 483-489, 500-506, 508-517, 523-534, 543-557, 565-580, 587-605, 609-617, 619-627, 631-636, 640-646, 
662-668^ 675-682, 705-710, 716-723, 727-732, 750-758, 784-789, 795-809, 869-874, 14-138, 166-286, 372-503, 
674-696 and 754-859 of Seq ID No 269; 5-17, 32-38, 40-47, 80-89, 113-119, 125-137, 140-154, 157-163, 170- 
177, 185-199, 213-225, 228-236, 242-248, 277-290, 292-305, 323-333, 347-353, 364-370, 385-394, 399-406, 423- 
433, 441-451, 462-474, 477-487 and 116-124 of Seq ID No 270; 7-16, 18-30, 32-49, 53-61, 63-85, 95-101, 105- 
115, 119-134, 143-150, 159-178, 185-202, 212-229, 236-250, 254-265, 268-294 and 63-72 of Seq ID No 271; 4- 
12, 19-47, 73-81, 97-103, 153-169, 188-198, 207-213, 217-223, 236-242, 255-265, 270-278, 298-305, 309-317, 
335-347, 354-363, 373-394, 419-424, 442-465, 486-492, 500-507, 542-549, 551-558, 560-572, 580-589, 607-614, 
617-623, 647-653, 666-676, 694-704, 706-714, 748-754, 765-772, 786-792, 795-806 and 358-370 of Seq ID No 
272; 18-28, 30-38, 40-46, 49-55, 69-78, 82-98, 104-134, 147-153, 180-190, 196-202, 218-236, 244-261, 266-273, 
275-286, 290-295, 301-314, 378-387, 390-395, 427-434 and 290-305 of Seq ID No 273; 4-13, 20-31, 39-51, 54- 
61, 69-84, 87-105, 117-124 and 108-125 of Seq ID No 274; 24-34, 43-54, 56-66, 68-79 and 50-69 of Seq ID No 
275; 5-43, 71-77, 102-131, 141-148, 150-156, 159-186, 191-207, 209-234, 255-268, 280-286, 293-299, 317-323, 
350-357, 363-372, 391-397, 406-418, 428-435, 455-465, 484-497, 499-505, 525-531, 575-582, 593-607, 621-633, 
638-649, 655-673, 684-698, 711-725, 736-741, 743-752, 759-769, 781-793, 813-831, 843-853, 894-905, 908-916, 
929-946, 953-963, 970-978, 1001-1007, 1011-1033, 165-178 and 818-974 of Seq ID No 276; 16-44, 63-86, 98- 
108, 185-191, 222-237, 261-274, 282-294, 335-345, 349-362, 374-384, 409-420, 424-430, 440-447, 453-460, 465- 
473, 475-504, 522-534, 538-551, 554-560, 567-582, 598-607, 611-619, 627-640, 643-653, 655-661, 669-680, 684- 
690^ 701-707, 715-731, 744-750, 756-763, 768-804, 829-837, 845-853, 855-879, 884-890, 910-928, 77-90, 144-212, 
279-355, 434-536, 782-810 and 875-902 of Seq ID No 277; 4-22, 29-41, 45-51, 53-66, 70-77, 86-95, 98-104, 106- 
124, 129-135, 142-151, 153-161, 169-176, 228-251, 284-299, 331-337, 339-370, 380-387, 393-398, 406-411, 423- 
433' 440-452, 461-469, 488-498, 501-516, 523-530, 532-559, 562-567, 570-602, 612-628, 630-645, 649-659, 666- 
672, 677-696, 714-723, 727-747 and 212-227 of Seq ID No 278; 4-9, 17-31, 35-41, 56-61, 66-75, 81-87, 90-124, 
133-138, 149-163, 173-192, 213-219, 221-262, 265-275, 277-282, 292-298, 301-307, 333-346, 353-363, 371-378, 
419-430, 435-448, 456-469, 551-570, 583-599, 603-612 and 275-291 of Seq ID No 279; 28-34, 53-58, 72-81, 
100-128, 145-154, 159-168, 172-189, 217-225, 227-249, 256-263, 299-309, 322-330, 361-379, 381-388, 392-401, 
404-417, 425-436, 440-446, 451-464, 469-487, 502-511, 543-551, 559-564, 595-601, 606-612, 615-626, 633-642, 
644-650, 664-670, 674-684, 692-701, 715-723, 726-734, 749-756, 763-771, 781-787, 810-843, 860-869, 882-889, 
907-917, 931-936, 941-948, 951-958, 964-971, 976-993, 1039-1049, 1051-1065, 1092-1121, 1126-1132, 1145- 
1151, 1158-1173, 1181-1192, 1194-1208, 1218-1223, 1229-1243, 1249-1254, 1265-1279, 1287-1297, 1303-1320, 
1334-1341, 1343-1358, 1372-1382, 1406-1417, 1419-1425, 1428-1434, 1441-1448, 1460-1473, 1494-1504, 1509- 
1514, 1529-1550, 654-669 and 1400-1483 of Seq ID No 280; 10-16, 20-25, 58-65, 97-109, 118-132, 134-146, 
148-155, 186-195, 226-233, 244-262, 275-284, 295-310, 317-322, 330-339, 345-351, 366-375, 392-403, 408-415, 

423- 430, 435-444, 446-457, 467-479, 486-499, 503-510, 525-537, 540-585, 602-612, 614-623, 625-634, 639-645, 
650-669, 700-707, 717-724, 727-739, 205-230 and 733-754 of Seq ID No 281; 5-22, 37-43, 72-81, 105-113, 128- 
133, 148-160, 188-194, 204-230, 238-245, 251-257 and 194-213 of Seq ID No 282; 16-21, 35-41, 56-72, 74-92, 
103-109 and 62-68 of Seq ID No 283; 4-15, 17-82, 90-104, 107-159, 163-170, 188-221, 234-245, 252-265 and 
220-235 of Seq ID No 284; 16-22, 36-46, 61-75, 92-107, 113-121, 139-145, 148-160 and 30-42 of Seq ID No 
285; 4-12, 20-26, 43-49, 55-62, 66-78, 121-127, 135-141, 146-161, 164-170, 178-189, 196-205, 233-238, 269-279, 
288-318, 325-332, 381-386, 400-407 and 328-346 of Seq ID No 286; 5-12, 31-49, 57-63, 69-79, 89-97, 99-114, 
116-127, 134-142, 147-154, 160-173, 185-193, 199-204, 211-222, 229-236, 243-249, 256-274 and 58-68 of Seq 
ID No 287; 10-20, 28-34, 39-53, 68-79, 84-90, 99-106 and 73-79 of Seq ID No 288; 14-37, 45-50, 61-66, 77-82, 
93-98, 109-114, 125-130, 141-146, 157-162, 173-178, 189-194, 205-210, 221-226, 237-242, 253-258, 269-274, 
285-290, 301-306, 316-332, 349-359, 371-378, 385-406, 34-307 and 312-385 of Seq ID No 289; 4-10, 17-38, 50- 
85, 93-99, 109-116, 128-185, 189-197, 199-210, 223-256, 263-287, 289-312, 327-337, 371-386, 389-394, 406-419, 

424- 432, 438-450, 458-463, 475-502, 507-513, 519-526, 535-542, 550-567 and 361-376 of Seq ID No 290; 10-39, 
42-93, 100-144, 155-176, 178-224, 230-244, 246-255, 273-282, 292-301, 308-325, 332-351, 356-361, 368-379, 
386-393, 400-421 and 138-155 of Seq ID No 291; 5-11, 17-34, 40-45, 50-55, 72-80, 101-123, 145-151, 164-172, 
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182-187, 189-195, 208-218, 220-241, 243-252, 255-270, 325-331, 365-371, 391-398, 402-418, 422-428, 430-435, 
443-452, 463-469, 476-484, 486-494, 503-509, 529-553, 560-565, 570-590, 608-614, 619-627, 654-661, 744-750, 
772-780, 784-790, 806-816, 836-853, 876-885, 912-918, 926-933, 961-975, 980-987, 996-1006, 1016-1028, 1043- 
1053, 1057-1062, 994-1003 and 1033-1056 of Seq ID No 292; 17-45, 64-71, 73-81, 99-109, 186-192, 223-238, 
262-275, 283-295, 336-346, 350-363, 375-385, 410-421, 425-431, 441-448, 454-463, 468-474, 476-512, 523-537, 
539-552, 568-583, 599-608, 612-620, 628-641, 644-654, 656-662, 670-681, 685-695, 702-708, 716-723, 725-735, 
757-764 769-798, 800-806, 808-816, 826-840, 846-854, 856-862, 874-881, 885-902, 907-928, 274-350 and 443- 
513 of Seq ID No 293; 4-22, 29-41, 45-51, 53-61, 70-76, 85-92, 99-104, 111-122, 134-140, 142-154, 163-174, 
224-232, 255-265, 273-279, 283-297, 330-335, 337-348, 356-367, 373-385, 391-396, 421-431, 442-455, 475-485, 
493-505 526-538, 544-561, 587-599, 605-620, 622-651, 662-670, 675-681, 687-692, 697-712, 714-735 and 252- 
262 of Seq ID No 294; 4-12, 15-35, 40-46, 50-59, 67-94, 110-128, 143-169, 182-188, 207-215, 218-228, 238-250 
and 74-90 of Seq ID No 295; 9-18, 42-58, 78-85, 88-95, 97-106, 115-122, 128-134, 140-145, 154-181, 186-202, 
204-223, 261-267, 269-278, 284-293, 300-336, 358-368 and 12-29 of Seq ID No 296; 7-34, 46-53, 62-72, 82-88, 
100-105 111-117, 132-137, 144-160, 166-180, 183-189, 209-221, 231-236, 246-253, 268-282, 286-293, 323-336, 
364-372, 378-392, 422-433 and 388-405 of Seq ID No 297; 21-27, 34-50, 72-77, 80-95, 164-177, 192-198, 202- 
220, 226-236, 239-247, 270-279, 285-292, 315-320, 327-334, 348-355, 364-371, 388-397, 453-476, 488-497, 534- 
545, 556-576, 582-588, 601-607, 609-616, 642-662, 674-681, 687-697, 709-715, 721-727, 741-755 and 621-739 of 
Seq ID No 298; 4-14, 16-77, 79-109 and 25-99 of Seq ID No 299; 4-9, 17-23, 30-37, 44-55, 65-72, 77-93, 102- 
121, 123-132, 146-153 and 17-29 of Seq ID No 300; 4-18, 25-41, 52-60, 83-92, 104-112, 117-123, 149-155, 159- 
167, 170-192, 201-210, 220-227, 245-250 and 124-137 of Seq ID No 301; 8-25, 50-55, 89-95, 138-143, 148-153, 
159-169, 173-179, 223-238, 262-268, 288-295, 297-308, 325-335, 403-409, 411-417, 432-446, 463-475, 492-501, 
524-530, 542-548, 561-574, 576-593, 604-609, 612-622, 637-654, 665-672, 678-685, 720-725, 731-739, 762-767, 
777-783, 820-838, 851-865, 901-908, 913-920, 958-970, 1000-1006, 1009-1015, 1020-1026, 1043-1052, 1055- 
1061, 1-128, 252-341, 771-793 and 1043-1058 of Seq ID No 302; 16-26, 33-46 and 64-76 of Seq ID No 303; 4- 
27, 69-77, 79-101, 117-123, 126-142, 155-161, 171-186, 200-206, 213-231, 233-244, 267-273, 313-329, 335-344, 
347-370, 374-379, 399-408, 422-443, 445-453, 461-468, 476-482, 518-534, 544-553, 556-567, 578-595, 601-620, 
626-636, 646-658, 666-681, 715-721, 762-768, 778-785, 789-803, 809-819, 22-108, 153-318, 391-527 and 638-757 
of Seq ID No 304; 6-21, 32-43, 62-92, 104-123, 135-141, 145-152, 199-216, 218-226, 237-247, 260-269, 274-283, 
297-303, 1-72 and 127-211 of Seq ID No 305; 6-26, 50-56, 83-89, 108-114, 123-131, 172-181, 194-200, 221-238, 
241-247, 251-259, 263-271, 284-292, 304-319, 321-335, 353-358, 384-391, 408-417, 424-430, 442-448, 459-466, 
487-500, 514-528, 541-556, 572-578, 595-601, 605-613, 620-631, 635-648, 660-670, 673-679, 686-693, 702-708, 
716-725, 730-735, 749-755, 770-777, 805-811, 831-837, 843-851, 854-860, 863-869, 895-901, 904-914, 922-929, 
933-938, 947-952, 956-963, 1000-1005, 1008-1014, 1021-1030, 1097-1103, 1120-1130, 1132-1140, 1-213, 269-592 
and 992-1120 of Seq ID No 306; 9-16, 33-39, 47-59, 65-79, 81-95, 103-108, 115-123, 138-148, 163-171, 176- 
185, 191-196, 205-211, 213-221, 224-256, 261-276, 294-302, 357-363, 384-390, 95-111 and 161-189 of Seq ID 
No 307; 21-27, 35-45, 70-76, 92-105, 129-143, 145-155, 161-166, 170-191, 204-211, 214-231, 234-246, 249-255, 
259-275 and 1-18 of Seq ID No 308; 21-35, 45-53, 56-64, 69-97 and 1-16 of Seq ID No 309; 25-33, 41-47, 61- 
68, 86-101, 106-114, 116-129, 134-142, 144-156, 163-176, 181-190, 228-251, 255-261, 276-292, 295-305, 334-357, 
368-380, 395-410, 424-429, 454-460, 469-482, 510-516, 518-527, 531-546, 558-570, 579-606, 628-636, 638-645, 
651-656, 668-674, 691-698, 717-734, 742-754, 765-770, 792-797, 827-835, 847-859, 874-881, 903-909, 926-933, 
942-961, 964-977, 989-1004, 1010-1028, 1031-1047, 1057-1075, 1081-1095, 1108-1117, 1138-1144, 1182-1189, 
1193-1206, 1220-1229, 1239-1246, 1257-1267, 1271-1279, 1284-1301, 1312-1320, 1329-1335, 1341-1347, 1358- 
1371, 1399-1404, 1417-1426, 1458-1463, 1468-1476, 1478-1485, 1493-1506, 1535-1541, 1559-1574, 1583-1590, 
1595-1601, 1603-1611, 1622-1628, 1634-1644, 1671-1685, 1689-1696, 1715-1720, 1734-1746, 1766-1775, 1801- 
1806, 1838-1844, 1858-1871, 1910-1917, 1948-1955, 1960-1974, 2000-2015, 2019-2036, 2041-2063, 748-847 and 
1381-1391 of Seq ID No 310; 5-12, 18-24, 27-53, 56-63, 96-113, 119-124, 131-136, 157-163, 203-209, 215-223, 
233-246, 264-273, 311-316, 380-389, 393-399, 425-433, 445-450, 457-462, 464-470, 475-482, 507-513, 527-535, 
542-548, 550-565, 591-602, 607-613, 627-642, 644-664, 673-712, 714-732, 739-764, 769-782, 812-818, 826-838, 
848-854, 860-871, 892-906, 930-938, 940-954, 957-973, 990-998, 1002-1021, 1024-1033, 1037-1042, 1050-1060, 
1077-1083, 1085-1092, 1100-1129, 1144-1161, 1169-1175, 1178-1189, 1192-1198, 1201-1207, 1211-1221, 1229- 
1239, 1250-1270, 1278-1292, 1294-1300, 1314-1335, 1344-1352, 1360-1374, 1394-1405, 1407-1414, 1416-1424, 
1432-1452, 1456-1462, 1474-1497, 1500-1510, 1516-1522, 1534-1542, 1550-1559, 1584-1603, 1608-1627, 187- 
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273 and 306-441 of Seq ID No 311; 70-80, 90-97, 118-125, 128-140, 142-148, 154-162, 189-202, 214-222, 224- 
232, 254-260, 275-313, 317-332, 355-360, 392-398, 425-432, 448-456, 464-470, 476-482, 491-505, 521-528, 533- 
546, 560-567, 592-597, 605-614, 618-626, 637-644, 646-653, 660-666, 677-691 and 207-227 of Seq ID No 312; 
5-19, 26-34, 37-55, 57-66, 69-83, 86-102, 115-134, 138-143, 154-172, 178-195, 209-246, 251-257, 290-302, 306- 
311 and 256-266 of Seq ID No 313; 10-20, 22-28, 35-57, 72-79, 87-103, 108-128, 130-144, 158-171, 190-198, 
225-242, 274-291, 301-315, 317-324, 374-385 and 353-365 of Seq ID No 314; 4-9, 17-30, 34-54, 59-66, 73-94, 
118-130, 135-150, 158-171, 189-198, 219-239, 269-275, 283-301, 89-106 and 176-193 of Seq ID No 315; 14-20, 
22-74, 77-86, 89-99, 104-109, 126-135, 154-165, 181-195, 197-212, 216-224, 264-275 and 107-118 of Seq ID No 
316; 4-18, 21-38, 63-72, 101-109, 156-162, 165-179, 183-192, 195-210, 212-218, 230-239, 241-256, 278-290, 299- 
311, 313-322, 332-341, 348-366, 386-401, 420-426, 435-450, 455-460, 468-479, 491-498, 510-518, 532-538, 545- 
552, 557-563, 567-573, 586-595, 599-609, 620-626, 628-636, 652-657, 665-681 and 1-198 of Seq ID No 317; 4- 
10, 16-38, 51-68, 73-79, 94-115, 120-125, 132-178, 201-208, 216-223, 238-266, 269-295, 297-304, 337-342, 347- 
356, 374-401, 403-422, 440-447, 478-504, 510-516, 519-530, 537-544 and 191-206 of Seq ID No 318; 12-40, 42- 
48, 66-71, 77-86, 95-102, 113-120, 129-137, 141-148, 155-174, 208-214, 218-225, 234-240, 256-267, 275-283, 
300-306, 313-321, 343-350, 359-367, 370-383, 398-405, 432-439, 443-461, 492-508, 516-526, 528-535 and 370- 
478 of Seq ID No 319; 6-14, 20-37, 56-62, 90-95, 97-113, 118-125, 140-145, 161-170, 183-202, 237-244, 275- 
284, 286-305, 309-316, 333-359, 373-401, 405-412 and 176-187 of Seq ID No 320; 33-44, 50-55, 59-80, 86-101, 
129-139, 147-153, 157-163, 171-176, 189-201, 203-224, 239-245, 257-262, 281-287, 290-297, 304-320, 322-331, 
334-350, 372-390, 396-401, 71-88 and 353-372 of Seq ID No 321; 5-11, 15-24, 26-33, 40-47, 75-88, 95-103, 105- 
112 and 17-30 of Seq ID No 322; 5-11, 16-39, 46-54, 62-82, 100-107, 111-124, 126-150, 154-165, 167-183, 204- 
238, 245-295, 301-313, 316-335 and 8-16 of Seq ID No 323; 4-19, 34-48, 69-74, 79-107, 115-127, 129-135, 143- 
153, 160-169, 171-182 and 142-153 of Seq ID No 324; 4-30, 65-74, 82-106, 110-120, 124-132, 135-140, 146-175, 
179-184, 190-196, 217-223, 228-233, 250-267, 275-292, 303-315, 322-332 and 174-186 of Seq ID No 325; 9-16, 
29-41, 47-57, 68-84, 87-109, 113-119, 162-180, 186-193, 195-201, 203-208, 218-230, 234-243, 265-271, 281-292, 
305-312, 323-332, 341-347, 349-363, 368-374, 383-390, 396-410, 434-440, 446-452, 455-464, 466-473, 515-522, 
529-542, 565-570, 589-600, 602-613, 618-623, 637-644, 1019-1027, 1238-1244, 1258-1264, 1268-1276, 1281- 
1292, 1296-1302 and 883-936 of Seq ID No 326; 10-17, 23-32, 39-44, 54-72, 75-81, 88-111, 138-154, 160-167, 
178-185, 201-210, 236-252, 327-334, 336-342, 366-376, 388-400, 410-430, 472-482, 493-526, 552-558, 586-592, 
598-603, 612-621, 630-635, 641-660 and 384-393 of Seq ID No 327; 4-22, 24-39, 50-59, 73-84, 100-105, 111- 
117, 130-138, 155-161, 173-178, 182-189, 205-215, 266-284, 308-313, 321-328, 330-337, 346-363, 368-374, 388- 
395, 397-405, 426-434, 453-459, 482-492, 501-507, 509-515, 518-523, 527-544, 559-590, 598-612, 614-629, 646- 
659, 663-684, 686-694, 698-721 and 445-461 of Seq ID No 328; 14-22, 27-33 and 3-17 of Seq ID No 329; 29- 
41, 66-73, 81-87, 90-108, 140-146, 150-159, 165-184, 186-196, 216-226, 230-238, 247-253, 261-269 and 126-140 
of Seq ID No 330; 5-12, 16-25, 27-33, 36-45, 60-68, 83-88, 103-126 and 86-101 of Seq ID No 331; 14-23, 36- 
47, 56-66, 84-89, 94-105, 111-127, 140-153, 160-174, 176-183, 189-203, 219-225, 231-237, 250-257 and 194-227 
of Seq ID No 332; 4-25, 54-60, 64-71, 73-82, 89-106, 117-124, 157-169, 183-188, 199-210, 221-232, 236-244, 
255-264 and 58-98 of Seq ID No 333; 13-19, 26-36, 41-53, 55-71, 77-84, 86-108, 114-135, 157-172, 177-183, 
187-194, 208-213, 218-226, 110-125 and 156-170 of Seq ID No 334; 5-24, 63-69, 77-85, 94-112, 120-137, 140- 
146, 152-159, 166-172, 179-187, 193-199, 206-212, 222-228, 234-240, 244-252, 257-264, 270-289, 298-309, 316- 
328, 337-348, 363-375, 1-56 and 340-352 of Seq ID No 335; 18-39, 42-71, 78-120, 124-144, 152-173, 179-189, 
199-209, 213-222, 228-258, 269-304, 329-361, 364-372, 374-389, 396-441 and 313-327 of Seq ID No 336; 19-25, 
91-98, 108-120, 156-162, 168-174, 191-204, 211-216, 232-266, 272-278, 286-308, 316-321, 327-333, 344-355, 
358-364, 384-391, 395-428, 464-476, 487-495, 497-511, 544-561, 563-573, 575-582, 588-594, 10-25 and 322-338 
of Seq ID No 337; 14-26, 32-49, 51-57, 59-72, 80-91, 102-112, 119-125, 147-161, 164-173, 175-183, 188-213, 
217-222, 246-254, 260-276, 282-303, 308-318, 321-328, 333-350, 352-359, 371-378, 392-401, 407-414, 416-443, 
448-463, 471-484, 490-497, 501-514, 519-527, 539-551, 557-570, 578-590, 592-598, 600-610, 618-629, 633-647, 
654-667, 676-689, 702-709, 718-726, 728-737, 741-760, 764-780, 786-795, 808-826, 836-842, 845-852, 865-874, 
881-887, 931-945, 949-957, 968-974, 979-986, 1003-1009, 1023-1029 and 90-103 of Seq ID No 338; 11-16, 37- 
56, 60-66, 69-77, 80-88, 93-106, 117-139, 166-171 and 72-90 of Seq ID No 339; 59-84, 123-133, 145-150, 161- 
167, 178-189 and 115-128 of Seq ID No 340; 15-33, 39-46, 52-64, 74-87, 108-124, 127-144, 150-156, 173-179, 
184-194, 201-208, 219-236, 243-269, 272-295, 302-309, 343-349, 356-361, 370-379, 405-411, 414-423, 430-451, 
457-464, 466-475, 477-483, 496-502, 507-522, 541-548, 557-563, 571-577, 579-585, 590-605, 626-642, 650-662, 
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671-691, 704-710, 751-769, 775-781, 786-791, 794-829, 851-858, 868-878, 884-904, 913-919, 931-939 and 132- 
142 of Seq ID No 341; 33-58, 64-71, 74-80, 83-88, 96-120, 122-139, 146-157, 167-177, 207-213, 220-225, 236- 
242, 264-279, 300-305, 326-336, 340-347, 350-360, 97-115 and 199-211 of Seq ID No 342; 4-26, 43-57, 70-99, 
102-117, 121-133, 142-148, 151-168, 170-183, 192-220, 235-249, 258-279 and 30-41 of Seq ID No 343; 34-42, 
48-58, 70-94, 110-130, 154-160, 164-172, 178-183, 195-203, 211-222, 229-250, 256-261, 274-284, 286-292, 312- 
323 and 222-233 of Seq ID No 344; 4-9, 15-36, 38-45, 49-74, 78-88, 100-112, 136-191, 211-220, 226-233, 239- 
246, 254-274, 287-307, 316-322, 342-353, 356-366, 373-378, 384-393, 405-431, 449-457, 459-468, 487-511, 515- 
524, 529-541, 544-552, 562-568, 571-576 and 208-280 of Seq ID No 345; 10-27, 31-37, 39-54, 71-108, 124-143 
and 2-107 of Seq ID No 346; 16-27, 38-57, 64-70, 90-102, 104-113, 116-137, 160-166 and 1-80 of Seq ID No 
347; 13-21, 31-36, 56-67, 127-136, 153-171, 173-180, 184-200, 214-222, 225-231, 239-263, 267-273 and 135-159 
of Seq ID No 348; 12-27, 31-51, 68-74, 77-87, 94-101, 108-114, 117-123, 127-134, 138-168, 173-196, 201-207, 
212-217, 227-237, 247-257, 264-280 and 205-223 of Seq ID No 349; 17-22, 25-54, 70-76, 92-100 and 98-110 of 
Seq ID No 350; 7-29, 40-50, 60-67, 87-96, 105-111, 119-164, 172-199, 206-212, 220-227, 237-259, 272-279, 282- 
293, 295-309, 313-319, 321-328, 345-363, 376-386 and 159-176 of Seq ID No 351; 4-19, 24-30, 36-43, 50-68, 
71-89, 93-106, 141-152, 154-172, 179-197, 199-215, 229-239, 246-252, 255-263, 281-298, 319-325, 329-356, 358- 
368, 374-390, 397-409, 420-429, 432-444, 450-456, 459-475, 483-494, 496-502, 520-528, 532-556 and 362-377 of 
Seq ID No 352; 18-25, 40-62, 77-85, 91-97, 105-116, 123-133, 139-184, 189-197 and 122-140 of Seq ID No 
353; 4-49, 52-58, 62-70, 79-105, 109-133, 142-150, 163-168, 206-214, 220-228, 233-240, 243-254, 274-281, 303- 
311, 327-338, 357-373, 378-396, 403-413, 420-436, 441-453, 461-467, 475-481, 484-498, 506-512, 514-521, 523- 
529, 562-579, 589-595, 598-603, 615-648, 714-722, 728-742, 749-758, 777-792, 795-807 and 643-658 of Seq ID 
No 354; 8-27, 37-48, 51-56, 72-79, 87-106, 120-138, 140-147, 167-176, 187-197, 205-216, 222-229, 234-239, 243- 
249, 277-288, 292-315, 334-343, 347-353, 363-391, 398-404, 430-447, 461-467, 478-492, 498-507 and 456-470 of 
Seq ID No 355; 5-12, 18-24, 59-69, 80-93, 95-109, 119-125, 130-137, 139-147, 158-163, 168-176, 182-202, 206- 
215, 222-239, 241-249, 267-277, 291-298, 311-318, 321-327, 338-344, 348-355, 373-386, 393-406, 411-417, 434- 
443, 446-465, 473-484, 514-521, 532-553, 584-594 and 221-237 of Seq ID No 356; 4-14, 27-34, 50-58, 63-72, 
79-106, 109-114, 121-142, 146-154, 161-167, 169-175, 178-201, 223-238, 249-254, 259-264, 278-292, 294-312, 
319-330 and 167-191 of Seq ID No 357; 7-28, 36-42, 50-61, 63-80, 122-152, 161-174, 176-191 and 140-190 of 
Seq ID No 358; 20-57, 59-65, 70-78, 86-102, 119-133, 142-161, 163-173, 177-188, 192-202, 204-220, 222-236, 
240-253, 279-319, 326-331, 337-383, 390-399, 406-412, 420-427, 431-438 and 381-395 of Seq ID No 359; 13-18, 
28-34, 37-43, 50-59, 75-81, 83-97, 105-121, 139-147, 200-206, 209-227, 231-247, 260-271, 318-327, 366-381, 388- 
394, 399-406 and 182-201 of Seq ID No 360; 6-29, 37-43, 51-56, 70-77, 82-102, 110-119, 127-143, 178-190, 
201-209, 216-243, 261-269, 281-292, 305-313, 327-339, 341-354, 356-373, 391-397, 423-429, 438-445, 450-478 
and 21-314 of Seq ID No 361; 4-12, 15-21, 32-41, 59-76, 80-89, 96-104 and 90-103 of Seq ID No 362; 9-28, 
30-41, 44-54, 69-74, 77-82, 90-97, 104-123, 125-135, 149-155, 164-173, 177-184, 217-226, 230-235, 238-244, 258- 
272, 282-297, 300-305, 309-315, 317-322, 327-336, 348-362, 368-374, 380-387, 400-411, 414-424, 451-458, 460- 
466, 483-494, 497-503, 506-511, 521-528, 540-553, 569-587, 598-606, 628-642, 661-681, 688-700, 718-733, 740- 
749, 752-764, 769-783, 823-834, 848-854, 862-872, 878-884, 886-898, 915-920, 938-951, 954-961, 963-972, 982- 
989, 996-1003, 1010-1016, 1021-1032, 1038-1044, 1047-1057, 1060-1070, 1079-1088, 1094-1102, 1117-1127, 
1129-1135, 1142-1153, 1158-1204, 1212-1229, 1234-1263, 1269-1277, 1308-1313, 1327-1338, 1344-1376, 1400- 
1415, 1436-1443, 1448-1458, 1497-1504, 1511-1522, 1544-1566, 3-82 and 509-576 of Seq ID No 363; 8-36, 40- 
64, 71-79, 88-94, 102-109, 118-127, 138-148, 151-159, 163-174, 192-198, 200-206, 220-233, 268-273, 290-301, 
304-309, 316-323, 331-349, 378-391, 414-420, 427-437, 455-475, 494-510, 541-547, 549-555, 616-640, 1-60, 55- 
139, 212-308, 386-458 and 458-624 of Seq ID No 364; 16-31, 35-42, 70-77, 91-101, 120-130, 132-140, 143-153, 
185-190, 195-202, 215-222, 228-238, 241-251, 257-264, 268-277, 288-302, 312-324, 326-333, 341-348, 364-382, 
415-429, 438-454, 458-466, 491-499, 501-521 and 273-281 of Seq ID No 365; 8-14, 32-57, 74-149, 155-177, 
179-212, 221-266, 271-296, 304-324, 329-346, 349-359, 368-401, 413-419, 426-454, 465-478, 493-510 and 466- 
490 of Seq ID No 366; 22-28, 33-51, 64-89, 96-119, 126-132, 138-146, 152-159, 161-169, 172-179, 193-198, 205- 
211, 221-231, 235-254, 273-280, 297-303, 312-320, 328-346, 351-373, 378-384, 391-398, 448-454, 460-468, 470- 
481, 516-558, 574-593, 597-602, 613-623, 626-646, 649-656, 668-673, 675-683, 696-708, 715-722, 724-739, 745- 
751, 759-777, 780-804, 816-822 and 102-113 of Seq ID No 367; 12-28, 41-91, 98-107, 112-120, 125-131, 151- 
193, 215-221, 240-250, 263-280 and 128-138 of Seq ID No 368; 16-24, 32-38, 46-62, 68-81, 90-105, 127-133, 
144-150, 160-166, 178-184, 186-202, 210-219, 232-240, 252-258, 264-273, 293-324, 337-344, 349-357, 360-369, 
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385-398, 410-416, 419-427, 441-449, 458-476, 508-515, 523-539, 544-549, 562-569, 571-579, 96-109 and 127-139 
of Seq ID No 369; 19-25, 28-34, 56-61, 85-97, 110-116 and 39-53 of Seq ID No 370; 4-37, 41-50, 62-72, 91-97, 
99-109, 114-125, 136-141, 149-158, 160-166, 201-215 and 27-225 of Seq ID No 371; 15-31, 44-51, 96-105, 122- 
130, 149-157, 162-168, 178-183, 185-192, 198-204, 206-213, 221-234, 239-245, 248-255, 257-266, 289-335, 349- 
357, 415-422, 425-441, 448-454, 462-468 and 463-481 of Seq ID No 372; 5-31, 39-55, 63-72, 76-99, 106-155, 
160-177, 179-199, 207-217, 223-240, 245-255, 261-267, 294-316, 321-343, 354-378, 382-452, 477-488, 529-536, 
555-569, 584-591, 593-612, 620-627, 632-640, 647-654, 671-680, 698-704, 723-730, 732-750, 769-775, 781-788, 
822-852 and 505-525 of Seq ID No 373; 3-18 of Seq ID No 374; 4-14 and 12-24 of Seq ID No 375; 4-11, 22- 
30 and 12-25 of Seq ID No 376; 5-12 and 4-18 of Seq ID No 377; 4-28 and 7-14 of Seq ID No 378; 6-16 and 
8-16 of Seq ID No 379; 4-15, 18-33 and 24-36 of Seq ID No 380; 4-10, 16-21 and 20-31 of Seq ID No 381; 6- 
19 of Seq ID No 382; 11-18 and 3-10 of Seq ID No 383; 13-24 and 3-15 of Seq ID No 384; 15-27 and 7-16 of 
Seq ID No 385; 11-16 and 1-15 of Seq ID No 386; 4-16 and 9-21 of Seq ID No 387; 4-24, 40-48, 54-67 and 
22-39 of Seq ID No 388; 6-30, 34-55, 62-68, 78-106 and 68-74 of Seq ID No 389; 3-14 of Seq ID No 390; 9-19 
and 6-21 of Seq ID No 391; 4-17 and 1-9 of Seq ID No 392; 5-30 and 1-8 of Seq ID No 393; 4-16, 23-46, 51- 
56 and 45-55 of Seq ID No 394; 7-16 of Seq ID No 395; 2-14 of Seq ID No 396; 4-36, 43-65 and 50-62 of 
Seq ID No 397; 10-30 and 14-21 of Seq ID No 398; 9-17 and 1-10 of Seq ID No 399; 4-12 and 3-16 of Seq 
ID No 400; 4-15 and 5-23 of Seq ID No 401; 10-21 of Seq ID No 402; 6-16 of Seq ID No 403; 4-29, 31-38 
and 2-14 of Seq ID No 404; 4-35 and 33-42 of Seq ID No 405; 2-17 of Seq ID No 406; 9-18, 30-35 and 15-33 
of Seq ID No 407; 4-9 and 6-12 of Seq ID No 408; 3-17 of Seq ID No 409; 12-21, 37-44, 52-61, 72-80 and 38- 
48 of Seq ID No 410; 4-10, 29-44, 54-61, 69-78 and 13-27 of Seq ID No 411; 13-23, 36-53 and 2-15 of Seq ID 
No 412; 4-25, 28-46, 56-72, 81-99, 120-132, 134-142, 154-160 and 129-141 of Seq ID No 413; 4-15, 24-33, 35- 
41, 64-86 and 21-33 of Seq ID No 414; 9-15 and 4-13 of Seq ID No 415; 4-11, 13-19, 34-48 and 15-32 of Seq 
ID No 416; 4-21 and 11-31 of Seq ID No 417; 23-57 and 38-50 of Seq ID No 418; 4-32 and 3-13 of Seq ID 
No 419; 4-10, 13-25, 32-42, 56-68, 72-84 and 26-38 of Seq ID No 420; 4-20, 31-48, 52-58, 65-71, 80-93, 99-108, 
114-123 and 37-49 of Seq ID No 421; 6-12, 14-20 and 3-25 of Seq ID No 422; 14-25, 27-38 and 5-14 of Seq 
ID No 423; 4-41, 57-105, 109-118, 123-136, 144-152 and 86-99 of Seq ID No 424; 6-19 of Seq ID No 425; 2- 
19 of Seq ID No 426; 14-47 and 1-14 of Seq ID No 427; 4-21, 29-44 and 2-18 of Seq ID No 428; 23-29 and 
10-28 of Seq ID No 429; 6-16, 22-36 and 11-22 of Seq ID No 430; 4-19, 30-44 and 18-27 of Seq ID No 431; 
5-15, 37-45, 58-65 and 38-47 of Seq ID No 432; 4-15, 23-34 and 4-15 of Seq ID No 433; 30-36, 44-54, 79-85, 
101-114, 138-152, 154-164, 170-175, 179-200, 213-220, 223-240, 243-255, 258-264, 268-284 and 10-28 of Seq 
ID No 434; the peptides comprising amino acid sequences of column "Identical region" of the Table 
IB, especially peptides comprising amino acid 210-226 and 738-753 of Seq ID No 449; 326-344, 326-348, 
338-354, 371-392, 801-809 and 877-901 of Seq ID No 450; 893-906 of Seq ID No 451; 51-69 of Seq ID No 
452; 110-125 of Seq ID No 453; 291-305 of Seq ID No 454; 210-226 and 738-753 of Seq ID No 455; 326-344, 
326-348, 338-354, 371-392, 801-809 and 877-901 of Seq ID No 456; 893-906 of Seq ID No 457; 51-69 of Seq 
ID No 458; 110-125 of Seq ID No 459; 291-305 of Seq ID No 460; 32-44 of Seq ID No 461; 399-410 of Seq 
ID No 462; the serum reactive epitopes as specified in the column of "aa from" to "aa to" of Table 2, 
especially peptides comprising amino acid 120-143, 138-161 and 156-179 of Seq ID No 218; 110-129 and 
168-184 of Seq ID No 219; 74-90 of Seq ID No 222; 759-773 of Seq ID No 223; 237-260 of Seq ID No 224; 
265-284 of Seq ID No 225; 65-74 of Seq ID No 226; 41-50 of Seq ID No 227; 163-174 of Seq ID No 229; 26- 
37 of Seq ID No 230; 174-189 of Seq ID No 232; 240-256 of Seq ID No 234; 285-297 of Seq ID No 236; 238- 
247 of Seq ID No 238; 491-519 of Seq ID No 239; 114-140 of Seq ID No 243; 267-284 of Seq ID No 250; 
439-453 of Seq ID No 252; 162-178 of Seq ID No 253; 347-364 of Seq ID No 254; 699-715 of Seq ID No 
255; 60-71 of Seq ID No 256; 244-257 of Seq ID No 257; 44-63 and 57-76 of Seq ID No 258; 185-196 of Seq 
ID No 260; 119-129 of Seq ID No 263; 182-195 of Seq ID No 266; 32-44 and 424-442 of Seq ID No 267; 
247-256 of Seq ID No 268; 678-694, 785-805, 55-77 and 72-94 of Seq ID No 269; 210-226 of Seq ID No 281; 
37-59 of Seq ID No 289; 13-29 of Seq ID No 296; 136-159 of Seq ID No 348; 205-222 of Seq ID No 349; 99- 
110 of Seq ID No 350; 160-176 of Seq ID No 351; 457-470 of Seq ID No 355; 221-237 of Seq ID No 356; 
167-190 of Seq ID No 357; 96-120 of Seq ID No 361; 399-417, 503-519 and 544-563 of Seq ID No 364; 46- 
68, 159-183 and 184-198 of Seq ID No 371; 463-481 of Seq ID No 372; the immunogenic epitopes as 
specified in the column of "aa from" to "aa to" of Table 4; especially peptides comprising amino acid 
110-129 and 168-184 of Seq ID No 219; 877-901, 333-354, 326-344 and 801-809 of Seq ID No 277; 1-54 of 
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Seq ID No 347; 544-563, 31-51, 107-119, 399-417 and 503-519 of Seq ID No 364; 120-198 of Seq ID No 218; 
20-35 of Seq ID No 219; 118-201 of Seq ID No 221; 48-132 of Seq ID No 242; 118-136 of Seq ID No 249; 
162-178 of Seq ID No 253; 347-364 of Seq ID No 254; 699-715 of Seq ID No 255; 50-76 of Seq ID No 258; 
785-819 and 44-128 of Seq ID No 269; 90-128 of Seq ID No 274; 314-384 of Seq ID No 289; 327-349 of Seq 
ID No 293; 242-314, 405-478 and 23-100 of Seq ID No 304; 129-210 of Seq ID No 305; 162-188 of Seq ID 
No 307; 750-772 of Seq ID No 310; 1-56 of Seq ID No 335; 322-337 of Seq ID No 337; 72-90 of Seq ID No 
339; 374-395 of Seq ID No 345; 136-159 of Seq ID No 348; 141-164 of Seq ID No 358; 96-157 of Seq ID No 
36l'- 1-82 of Seq ID No 363; 489-556 of Seq ID No 364; 159-183 and 49-133 of Seq ID No 371; The 
peptides comprising amino acid sequences of column "predicted immunogenic aa" and "location of 
identified immunogenic region (aa)" of Table 5, especially peptides comprising amino acid 4-26, 35-41, 
53-61, 73-84, 103-108, 114-120, 140-146, 156-162, 192-208, 214-219, 227-233, 239-252, 260-268, 284-297, 1-48 
and 113-133 of Seq ID No 475; 4-27, 38-44, 50-56, 59-64, 72-79, 83-89, 92-97, 108-116, 123-148, 152-167, 183- 
196, 200-220, 232-244, 255-261, 265-274, 282-302, 309-317, 1-79 and 231-302 of Seq ID No 476; 6-28, 66-72, 
85-105, 115-121, 144-151, 160-170, 176-185, 223-230, 252-288, 296-310, 319-333, 367-374, 458-464, 471-480, 
483-488, 520-528, 530-549, 559-564, 593-601, 606-616, 636-643, 655-662, 676-682, 684-699, 719-726, 735-750, 
757-764, 777-785, 799-810, 812-843, 846-853, 868-873, 880-889, 891-899, 909-929, 934-940, 963-969, 998-1004, 
1007-1014, 1016-1022, 1030-1046, 1-80 and 808-821 of Seq ID No 477; 7-24, 35-41, 75-81, 91-114, 122-132, 
137-144, 148-156, 183-192, 194-200, 212-228, 233-238, 251-258, 275-295, 326-332, 337-346, 1-79 and 305-321 
of Seq ID No 478; 31-38, 42-52, 66-72, 86-92, 98-104, 115-122, 127-146, 154-164, 169-187, 198-212, 225-237, 
255-269, 13-92 and 135-142 of Seq ID No 479; 4-36, 39-49, 63-69, 71-77, 81-88, 123-131, 133-139, 160-169, 
174-180, 188-194, 210-217, 273-278, 289-300, 317-334, 336-341, 383-401, 425-438, 1-68, 212-270 and 402-446 
of Seq ID No 480; 21-29, 31-42, 49-63, 72-79, 81-93, 112-132, 159-165, 188-195, 197-232, 262-267, 279-286, 
294-301, 318-326, 348-366, 381-405, 409-426, 436-465, 471-480, 484-492, 497-505, 521-544, 554-561, 567-577, 
581-589, 601-609, 611-622, 636-651, 653-667, 669-685, 700-708, 716-722, 729-744, 749-766, 780-786, 789-811, 
814-864, 1-57 and 84-106 of Seq ID No 481; 6-24, 35-48, 57-63, 72-78, 87-92, 113-119, 123-137, 147-153, 173- 
181, 212-233 and 1-124 of Seq ID No 482; 13-34, 62-69, 78-83, 86-91, 98-104, 107-115, 146-159, 179-188, 195- 
205, 209-221, 226-233, 239-253, 276-282, 284-294, 297-308, 331-354, 375-382, 388-399, 421-433, 449-458, 464- 
469, 472-491, 508-513, 525-531, 534-550, 575-593, 601-618, 629-635, 654-661, 666-680, 706-721, 723-740, 771- 
805, 810-830, 845-851 and 1-84 of Seq ID No 483; 4-32, 45-64, 73-83, 86-92, 100-111, 125-147, 157-163, 170- 
175, 177-188, 226-232, 245-252, 258-274, 320-335, 348-359 and 1-71 of Seq ID No 484; 13-40, 43-71, 76-83, 
87-101, 109-119, 125-156, 162-175, 182-219, 226-232, 240-262, 270-287, 306-318, 326-342, 344-408, 414-444, 
449-456 and 1-51 of Seq ID No 485; 4-16, 18-34, 45-54, 99-108, 134-140, 203-212, 241-257, 266-274, 279-291, 
308-315, 330-336, 355-370, 374-382, 402-410, 428-455, 466-472, 474-480, 531-554, 560-566, 572-580, 597-618, 
632-660, 664-674, 676-685, 691-705, 708-735, 750-768, 1-87 and 342-480 of Seq ID No 486; The serum 
reactive epitopes as specified in the column of "aa from" to "aa to" of Table 6, especially peptides 
comprising amino acid 115-132 and 1-26 of Seq ID No 475; 33-55 of Seq ID No 476; 1-25 of Seq ID No 
478; 37-61 of Seq ID No 479; 1-24 of Seq ID No 480; 1-23 of Seq ID No 481; 46-60 of Seq ID No 482; 1-28, 
23-50 and 45-71 of Seq ID No 483; 1-22 and 17-38 of Seq ID No 484; 1-22 and 17-38 of Seq ID No 485; 1- 
27, 22-47 and 422-447 of Seq ID No 486; The immunogenic epitopes as specified in the column of "aa 
from" to "aa to" of Table 7, especially peptides comprising amino acid 115-132 and 1-47 of Seq ID No 
475; 1-55 of Seq ID No 476; 22-85 of Seq ID No 477; 307-320 and 1-44 of Seq ID No 478; 15-76 and 40-92 
of Seq ID No 479; 1-59, 213-269 and 403-445 of Seq ID No 480; 1-56 and 85-105 of Seq ID No 481; 37-121 
of Seq ID No 482; 1-71 of Seq ID No 483; 1-38 of Seq ID No 484; 1-38 of Seq ID No 485; 1-47 of Seq ID 
No 486 and fragments comprising at least 6, preferably more than 8, especially more than 10 aa and 
preferably not more than 70, 50, 40, 20, 15 or 11 aa of said sequences. All these fragments individually 
and each independently form a preferred selected aspect of the present invention. 

All linear hyperimmune serum reactive fragments of a particular antigen may be identified by analysing 
the entire sequence of the protein antigen by a set of peptides overlapping by 1 amino acid with a length 
of at least 10 amino acids. Subsequently, non-linear epitopes can be identified by analysis of the protein 
antigen with hyperimmune sera using the expressed full-length protein or domain polypeptides thereof. 
Assuming that a distinct domain of a protein is sufficient to form the 3D structure independent from the 



WO 2004/099242 



PCT/EP2004/004856 



-35- 

native protein, the analysis of the respective recombinant or synthetically produced domain polypeptide 
with hyperimmune serum would allow the identification of conformational epitopes within the 
individual domains of multi-domain proteins. For those antigens where a domain possesses linear as well 
as conformational epitopes, competition experiments with peptides corresponding to the linear epitopes 
may be used to confirm the presence of conformational epitopes. 

It will be appreciated that the invention also relates to, among others, nucleic acid molecules encoding the 
aforementioned fragments, nucleic acid molecules that hybridise to nucleic acid molecules encoding the 
fragments, particularly those that hybridise under stringent conditions, and nucleic acid molecules, such 
as PCR primers, for amplifying nucleic acid molecules that encode the fragments. In these regards, 
preferred nucleic acid molecules are those that correspond to the preferred fragments, as discussed 
above. 

The present invention also relates to vectors, which comprise a nucleic acid molecule or nucleic acid 
molecules of the present invention, host cells which are genetically engineered with vectors of the 
invention and the production of hyperimmune serum reactive antigens and fragments thereof by 
recombinant techniques. 

A great variety of expression vectors can be used to express a hyperimmune serum reactive antigen or 
fragment thereof according to the present invention. Generally, any vector suitable to maintain, 
propagate or express nucleic acids to express a polypeptide in a host may be used for expression in this 
regard. In accordance with this aspect of the invention the vector may be, for example, a plasmid vector, 
a single or double-stranded phage vector, a single or double-stranded RNA or DNA viral vector. Starting 
plasmids disclosed herein are either commercially available, publicly available, or can be constructed 
from available plasmids by routine application of well-known, published procedures. Preferred among 
vectors, in certain respects, are those for expression of nucleic acid molecules and hyperimmune serum 
reactive antigens or fragments thereof of the present invention. Nucleic acid constructs in host cells can 
be used in a conventional manner to produce the gene product encoded by the recombinant sequence. 
Alternatively, the hyperimmune serum reactive antigens and fragments thereof of the invention can be 
synthetically produced by conventional peptide synthesizers. Mature proteins can be expressed in 
mammalian cells, yeast, bacteria, or other cells under the control of appropriate promoters. Cell-free 
translation systems can also be employed to produce such proteins using RNAs derived from the DNA 
construct of the present invention. 

Host cells can be genetically engineered to incorporate nucleic acid molecules and express nucleic acid 
molecules of the present invention. Representative examples of appropriate hosts include bacterial cells, 
such as streptococci, staphylococci, E. coli, Streptomyces and Bacillus subtillis cells; fungal cells, such as 
yeast cells and Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells 
such as CHO, COS, Hela, C127, 3T3, BHK, 293 and Bowes melanoma cells; and plant cells. 

The invention also provides a process for producing a S. agalactiae hyperimmune serum reactive antigen 
and a fragment thereof comprising expressing from the host cell a hyperimmune serum reactive antigen 
or fragment thereof encoded by the nucleic acid molecules provided by the present invention. The 
invention further provides a process for producing a cell, which expresses a S, agalactiae hyperimmune 
serum reactive antigen or a fragment thereof comprising transforming or transfecting a suitable host cell 
with the vector according to the present invention such that the transformed or transfected cell expresses 
the polypeptide encoded by the nucleic acid contained in the vector. 

The polypeptide may be expressed in a modified form, such as a fusion protein, and may include not 
only secretion signals but also additional heterologous functional regions. Thus, for instance, a region of 
additional amino acids, particularly charged amino acids, may be added to the N- or C-terminus of the 
polypeptide to improve stability and persistence in the host cell, during purification or during 
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subsequent handling and storage. Also, regions may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final preparation of the polypeptide. The addition of 
peptide moieties to polypeptides to engender secretion or excretion, to improve stability or to facilitate 
purification, among others, are familiar and routine techniques in the art. A preferred fusion protein 
comprises a heterologous region from immunoglobulin that is useful to solubilize or purify pofypeptides. 
For example, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion proteins comprising 
various portions of constant region of immunoglobin molecules together with another protein or part 
thereof. In drug discovery, for example, proteins have been fused with antibody Fc portions for the 
purpose of high-throughout screening assays to identify antagonists. See for example, {Bennett, D. et aL, 
1995} and {Johanson, K. et al., 1995}. 

The S. agalactiae hyperimmune serum reactive antigen or a fragment thereof can be recovered and 
purified from recombinant cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, hydroxylapatite chromatography and lectin 
chromatography. 

The hyperimmune serum reactive antigens and fragments thereof according to the present invention can 
be produced by chemical synthesis as well as by biotechnological means. The latter comprise the 
transfection or transformation of a host cell with a vector containing a nucleic acid according to the 
present invention and the cultivation of the transfected or transformed host cell under conditions, which 
are known to the ones skilled in the art. The production method may also comprise a purification step in 
order to purify or isolate the polypeptide to be manufactured. In a preferred embodiment the vector is a 
vector according to the present invention. 

The hyperimmune serum reactive antigens and fragments thereof according to the present invention may 
be used for the detection of the organism or organisms in a sample containing these organisms or 
polypeptides derived thereof. Preferably such detection is for diagnosis, more preferable for the diagnosis 
of a disease, most preferably for the diagnosis of a diseases related or linked to the presence or abundance 
of Gram-positive bacteria, especially bacteria selected from the group comprising streptococci, 
staphylococci and lactococci. More preferably, the microorganisms are selected from the group 
comprising Streptococcus pneumoniae, Streptococcus pyogenes and Streptococcus mutans, especially the 
microorganism is Streptococcus pyogenes. 

The present invention also relates to diagnostic assays such as quantitative and diagnostic assays for 
detecting levels of the hyperimmune serum reactive antigens and fragments thereof of the present 
invention in cells and tissues, including determination of normal and abnormal levels. Thus, for instance, 
a diagnostic assay in accordance with the invention for detecting over-expression of the polypeptide 
compared to normal control tissue samples may be used to detect the presence of an infection, for 
example, and to identify the infecting organism. Assay techniques that can be used to determine levels of 
a polypeptide, in a sample derived from a host are well known to those of skill in the art. Such assay 
methods include radioimmunoassays, competitive-binding assays, Western Blot analysis and ELISA 
assays. Among these, ELISAs frequently are preferred. An ELISA assay initially comprises preparing an 
antibody specific to the polypeptide, preferably a monoclonal antibody. In addition, a reporter antibody 
generally is prepared which binds to the monoclonal antibody. The reporter antibody is attached to a 
detectable reagent such as radioactive, fluorescent or enzymatic reagent, such as horseradish peroxidase 
enzyme. 

The hyperimmune serum reactive antigens and fragments thereof according to the present invention may 
also be used for the purpose of or in connection with an array. More particularly, at least one of the 
hyperimmune serum reactive antigens and fragments thereof according to the present invention may be 
immobilized on a support. Said support typically comprises a variety of hyperimmune serum reactive 
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antigens and fragments thereof whereby the variety may be created by using one or several of the 
hyperimmune serum reactive antigens and fragments thereof according to the present invention and/or 
hyperimmune serum reactive antigens and fragments thereof being different. The characterizing feature 
of such array as well as of any array in general is the fact that at a distinct or predefined region or 
position on said support or a surface thereof, a distinct polypeptide is immobilized. Because of this any 
activity at a distinct position or region of an array can be correlated with a specific polypeptide. The 
number of different hyperimmune serum reactive antigens and fragments thereof immobilized on a 
support may range from as little as 10 to several 1000 different hyperimmune serum reactive antigens 
and fragments thereof. The density of hyperimmune serum reactive antigens and fragments thereof per 
cm 2 is in a preferred embodiment as little as 10 peptides/polypeptides per cm 2 to at least 400 different 
peptides/polypeptides per cm 2 and more particularly at least 1000 different hyperimmune serum reactive 
antigens and fragments thereof per cm 2 . 

The manufacture of such arrays is known to the one skilled in the art and, for example, described in US 
patent 5,744,309. The array preferably comprises a planar, porous or non-porous solid support having at 
least a first surface. The hyperimmune serum reactive antigens and fragments thereof as disclosed herein, 
are immobilized on said surface. Preferred support materials are, among others, glass or cellulose. It is 
also within the present invention that the array is used for any of the diagnostic applications described 
herein. Apart from the hyperimmune serum reactive antigens and fragments thereof according to the 
present invention also the nucleic acid molecules according to the present invention may be used for the 
generation of an array as described above. This applies as well to an array made of antibodies, preferably 
monoclonal antibodies as, among others, described herein. 

In a further aspect the present invention relates to an antibody directed to any of the hyperimmune 
serum reactive antigens and fragments thereof, derivatives or fragments thereof according to the present 
invention. The present invention includes, for example, monoclonal and polyclonal antibodies, chimeric, 
single chain, and humanized antibodies, as well as Fab fragments, or the product of a Fab expression 
library. It is within the present invention that the antibody may be chimeric, i. e. that different parts 
thereof stem from different species or at least the respective sequences are taken from different species. 

Antibodies generated against the hyperimmune serum reactive antigens and fragments thereof 
corresponding to a sequence of the present invention can be obtained by direct injection of the 
hyperimmune serum reactive antigens and fragments thereof into an animal or by administering the 
hyperimmune serum reactive antigens and fragments thereof to an animal, preferably a non-human. The 
antibody so obtained will then bind the hyperimmune serum reactive antigens and fragments thereof 
itself. In this manner, even a sequence encoding only a fragment of a hyperimmune serum reactive 
antigen and fragments thereof can be used to generate antibodies binding the whole native hyperimmune 
serum reactive antigen and fragments thereof. Such antibodies can then be used to isolate the 
hyperimmune serum reactive antigens and fragments thereof from tissue expressing those hyperimmune 
serum reactive antigens and fragments thereof. 

For preparation of monoclonal antibodies, any technique known in the art, which provides antibodies 
produced by continuous cell line cultures can be used (as described originally in {Kohler, G. et al., 1975}. 

Techniques described for the production of single chain antibodies (U.S. Patent No. 4,946,778) can be 
adapted to produce single chain antibodies to immunogenic hyperimmune serum reactive antigens and 
fragments thereof according to this invention. Also, transgenic mice, or other organisms such as other 
mammals, may be used to express humanized antibodies to immunogenic hyperimmune serum reactive 
antigens and fragments thereof according to this invention. 

Alternatively, phage display technology or ribosomal display could be utilized to select antibody genes 
with binding activities towards the hyperimmune serum reactive antigens and fragments thereof either 
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from repertoires of PGR amplified v-genes of lymphocytes from humans screened for possessing 
respective target antigens or from naive libraries {McCafferty, J. et al., 1990}; {Marks, J. et al., 1992}. The 
affinity of these antibodies can also be improved by chain shuffling {Clackson, T. et al., 1991}. 

If two antigen binding domains are present, each domain may be directed against a different epitope - 
termed 'bispecific' antibodies. 

The above-described antibodies may be employed to isolate or to identify clones expressing the 
hyperimmune serum reactive antigens and fragments thereof or purify the hyperimmune serum reactive 
antigens and fragments thereof of the present invention by attachment of the antibody to a solid support 
for isolation and/or purification by affinity chromatography. 

Thus, among others, antibodies against the hyperimmune serum reactive antigens and fragments thereof 
of the present invention may be employed to inhibit and/or treat infections, particularly bacterial 
infections and especially infections arising from S. agalactiae. 

Hyperimmune serum reactive antigens and fragments thereof include antigenically, epitopically or 
immunologically equivalent derivatives, which form a particular aspect of this invention. The term 
"antigenically equivalent derivative" as used herein encompasses a hyperimmune serum reactive antigen 
and fragments thereof or its equivalent which will be specifically recognized by certain antibodies which, 
when raised to the protein or hyperimmune serum reactive antigen and fragments thereof according to 
the present invention, interfere with the interaction between pathogen and mammalian host. The term 
"immunologically equivalent derivative" as used herein encompasses a peptide or its equivalent which 
when used in a suitable formulation to raise antibodies in a vertebrate, the antibodies act to interfere with 
the interaction between pathogen and mammalian host. 

The hyperimmune serum reactive antigens and fragments thereof, such as an antigenically or 
immunologically equivalent derivative or a fusion protein thereof can be used as an antigen to immunize 
a mouse or other animal such as a rat or chicken. The fusion protein may provide stability to the 
hyperimmune serum reactive antigens and fragments thereof. The antigen may be associated, for 
example by conjugation, with an immunogenic carrier protein, for example bovine serum albumin (BSA) 
or keyhole limpet haemocyanin (KLH). Alternatively, an antigenic peptide comprising multiple copies of 
the protein or hyperimmune serum reactive antigen and fragments thereof, or an antigenically or 
immunologically equivalent hyperimmune serum reactive antigen and fragments thereof, may be 
sufficiently antigenic to improve immunogenicity so as to obviate the use of a carrier. 

Preferably the antibody or derivative thereof is modified to make it less immunogenic in the individual. 
For example, if the individual is human the antibody may most preferably be "humanized", wherein the 
complimentarity determining region(s) of the hybridoma-derived antibody has been transplanted into a 
human monoclonal antibody, for example as described in {Jones, P. et al., 1986} or {Tempest, P. et al., 
1991}. 

The use of a polynucleotide of the invention in genetic immunization will preferably employ a suitable 
delivery method such as direct injection of plasmid DNA into muscle, delivery of DNA complexed with 
specific protein carriers, coprecipitation of DNA with calcium phosphate, encapsulation of DNA in 
various forms of liposomes, particle bombardment {Tang, D. et al., 1992}, {Eisenbraun, M. et al., 1993} and 
in vivo infection using cloned retroviral vectors {Seeger, C. et al., 1984}. 

In a further aspect the present invention relates to a peptide binding to any of the hyperimmune serum 
reactive antigens and fragments thereof according to the present invention, and a method for the 
manufacture of such peptides whereby the method is characterized by the use of the hyperimmune 
serum reactive antigens and fragments thereof according to the present invention and the basic steps are 
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known to the one skilled in the art. 

Such peptides may be generated by using methods according to the state of the art such as phage display 
or ribosome display. In case of phage display, basically a library of peptides is generated, in form of 
phages, and this kind of library is contacted with the target molecule, in the present case a hyperimmune 
serum reactive antigen and fragments thereof according to the present invention. Those peptides binding 
to the target molecule are subsequently removed, preferably as a complex with the target molecule, from 
the respective reaction. It is known to the one skilled in the art that the binding characteristics, at least to a 
certain extent, depend on the particularly realized experimental set-up such as the salt concentration and 
the like. After separating those peptides binding to the target molecule with a higher affinity or a bigger 
force, from the non-binding members of the library, and optionally also after removal of the target 
molecule from the complex of target molecule and peptide, the respective peptide(s) may subsequently 
be characterised. Prior to the characterisation optionally an amplification step is realized such as, e. g. by 
propagating the peptide encoding phages. The characterisation preferably comprises the sequencing of 
the target binding peptides. Basically, the peptides are not limited in their lengths, however, preferably 
peptides having a lengths from about 8 to 20 amino acids are preferably obtained in the respective 
methods. The size of the libraries may be about 10 2 to 10 18 , preferably 10* to lO* different peptides, 
however, is not limited thereto. 

A particular form of target binding hyperimmune serum reactive antigens and fragments thereof are the 
so-called "anticalines" which are, among others, described in German patent application DE 197 42 706. 

In a further aspect the present invention relates to functional nucleic acids interacting with any of the 
hyperimmune serum reactive antigens and fragments thereof according to the present invention, and a 
method for the manufacture of such functional nucleic acids whereby the method is characterized by the 
use of the hyperimmune serum reactive antigens and fragments thereof according to the present 
invention and the basic steps are known to the one skilled in the art. The functional nucleic acids are 
preferably aptamers and spiegelmers. 

Aptamers are D-nucleic acids, which are either single stranded or double stranded and which specifically 
interact with a target molecule. The manufacture or selection of aptamers is, e.g. described in European 
patent EP 0 533 838. Basically the following steps are realized. First, a mixture of nucleic acids, i. e. 
potential aptamers, is provided whereby each nucleic acid typically comprises a segment of several, 
preferably at least eight subsequent randomised nucleotides. This mixture is subsequently contacted with 
the target molecule whereby the nucleic acid(s) bind to the target molecule, such as based on an increased 
affinity towards the target or with a bigger force thereto, compared to the candidate mixture. The binding 
nucleic acid(s) are/is subsequently separated from the remainder of the mixture. Optionally, the thus 
obtained nucleic acid(s) is amplified using, e.g. polymerase chain reaction. These steps may be repeated 
several times giving at the end a mixture having an increased ratio of nucleic acids specifically binding to 
the target from which the final binding nucleic acid is then optionally selected. These specifically binding 
nucleic acid(s) are referred to as aptamers. It is obvious that at any stage of the method for the generation 
or identification of the aptamers samples of the mixture of individual nucleic acids may be taken to 
determine the sequence thereof using standard techniques. It is within the present invention that the 
aptamers may be stabilized such as, e. g., by introducing defined chemical groups which are known to 
the one skilled in the art of generating aptamers. Such modification may for example reside in the 
introduction of an amino group at the 2' -position of the sugar moiety of the nucleotides. Aptamers are 
currently used as therapeutical agents. However, it is also within the present invention that the thus 
selected or generated aptamers may be used for target validation and/or as lead substance for the 
development of medicaments, preferably of medicaments based on small molecules. This is actually done 
by a competition assay whereby the specific interaction between the target molecule and the aptamer is 
inhibited by a candidate drug whereby upon replacement of the aptamer from the complex of target and 
aptamer it may be assumed that the respective drug candidate allows a specific inhibition of the 
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interaction between target and aptamer, and if the interaction is specific, said candidate drug will, at least 
in principle, be suitable to block the target and thus decrease its biological availability or activity in a 
respective system comprising such target. The thus obtained small molecule may then be subject to 
further derivatisation and modification to optimise its physical, chemical, biological and/or medical 
characteristics such as toxicity, specificity, biodegradability and bioavailability. 

Spiegelmers and their generation or manufacture is based on a similar principle. The manufacture of 
spiegelmers is described in international patent application WO 98/08856. Spiegelmers are L-nucleic 
acids, which means that they are composed of L-nucleotides rather than D-nucleotides as aptamers are. 
Spiegelmers are characterized by the fact that they have a very high stability in biological systems and, 
comparable to aptamers, specifically interact with the target molecule against which they are directed. In 
the process of generating spiegelmers, a heterogeonous population of D-nucleic acids is created and this 
population is contacted with the optical ant ip ode of the target molecule, in the present case for example 
with the D-enantiomer of the naturally occurring L-enantiomer of the hyperimmune serum reactive 
antigens and fragments thereof according to the present invention. Subsequently, those D-nucleic acids 
are separated which do not interact with the optical antipode of the target molecule. But those D-nucleic 
acids interacting with the optical antipode of the target molecule are separated, optionally identified 
and/or sequenced and subsequently the corresponding L-nucleic acids are synthesized based on the 
nucleic acid sequence information obtained from the D-nucleic acids. These L-nucleic acids which are 
identical in terms of sequence with the aforementioned D-nucleic acids interacting with the optical 
antipode of the target molecule, will specifically interact with the naturally occurring target molecule 
rather than with the optical antipode thereof. Similar to the method for the generation of aptamers it is 
also possible to repeat the various steps several times and thus to enrich those nucleic acids specifically 
interacting with the optical antipode of the target molecule. 

In a further aspect the present invention relates to functional nucleic acids interacting with any of the 
nucleic acid molecules according to the present invention, and a method for the manufacture of such 
functional nucleic acids whereby the method is characterized by the use of the nucleic acid molecules and 
their respective sequences according to the present invention and the basic steps are known to the one 
skilled in the art. The functional nucleic acids are preferably ribozymes, antisense oligonucleotides and 
siRNA. 

Ribozymes are catalytically active nucleic acids, which preferably consist of RNA, which basically 
comprises two moieties. The first moiety shows a catalytic activity whereas the second moiety is 
responsible for the specific interaction with the target nucleic acid, in the present case the nucleic acid 
coding for the hyperimmune serum reactive antigens and fragments thereof according to the present 
invention. Upon interaction between the target nucleic acid and the second moiety of the ribozyme, 
typically by hybridisation and Watson-Crick base pairing of essentially complementary stretches of bases 
on the two hybridising strands, the catalytically active moiety may become active which means that it 
catalyses, either intramolecularly or intermolecularly, the target nucleic acid in case the catalytic activity 
of the ribozyme is a phosphodiesterase activity. Subsequently, there may be a further degradation of the 
target nucleic acid, which in the end results in the degradation of the target nucleic acid as well as the 
protein derived from the said target nucleic acid. Ribozymes, their use and design principles are known 
to the one skilled in the art, and, for example described in {Doherty, E. et al., 2001} and {Lewin, A. et al., 
2001}. 

The activity and design of antisense oligonucleotides for the manufacture of a medicament and as a 
diagnostic agent, respectively, is based on a similar mode of action. Basically, antisense oligonucleotides 
hybridise based on base complementarity, with a target RNA, preferably with a mRNA, thereby 
activating RNase H. RNase H is activated by both phosphodiester and phosphorothioate-coupled DNA. 
Phosphodiester-coupled DNA, however, is rapidly degraded by cellular nucleases with the exception of 
phosphorothioate-coupled DNA. These resistant, non-naturally occurring DNA derivatives do not inhibit 
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RNase H upon hybridisation with RNA. In other words, antisense polynucleotides are only effective as 
DNA RNA hybride complexes. Examples for this kind of antisense oligonucleotides are described, 
among others, in US-patent US 5,849,902 and US 5,989,912. In other words, based on the nucleic acid 
sequence of the target molecule which in the present case are the nucleic acid molecules for the 
hyperimmune serum reactive antigens and fragments thereof according to the present invention, either 
from the target protein from which a respective nucleic acid sequence may in principle be deduced, or by 
knowing the nucleic acid sequence as such, particularly the mRNA, suitable antisense oligonucleotides 
may be designed base on the principle of base complementarity. 

Particularly preferred are antisense-oligonucleotides, which have a short stretch of phosphorothioate 
DNA (3 to 9 bases). A minimum of 3 DNA bases is required for activation of bacterial RNase H and a 
minimum of 5 bases is required for mammalian RNase H activation. In these chimeric oligonucleotides 
there is a central region that forms a substrate for RNase H that is flanked by hybridising "arms" 
comprised of modified nucleotides that do not form substrates for RNase H. The hybridising arms of the 
chimeric oligonucleotides may be modified such as by 2'-0-methyl or 2'~fluoro. Alternative approaches 
used methylphosphonate or phosphoramidate linkages in said arms. Further embodiments of the 
antisense oligonucleotide useful in the practice of the present invention are P-methoxyoligonucleotides, 
partial P-methoxyoligodeoxyribonucleotides or P-methoxyoligonucleotides. 

Of particular relevance and usefulness for the present invention are those antisense oligonucleotides as 
more particularly described in the above two mentioned US patents. These oligonucleotides contain no 
naturally occurring 5'-»3'-linked nucleotides. Rather the oligonucleotides have two types of nucleotides: 
2'-deoxyphosphorothioate, which activate RNase H, and 2'-modified nucleotides, which do not. The 
linkages between the 2'-modified nucleotides can be phosphodiesters, phosphorothioate or P- 
ethoxyphosphodiester. Activation of RNase H is accomplished by a contiguous RNase H-activating 
region, which contains between 3 and 5 2'-deoxyphosphorothioate nucleotides to activate bacterial RNase 
H and between 5 and 10 2'- deoxyphosphorothioate nucleotides to activate eucaryotic and, particularly, 
mammalian RNase H. Protection from degradation is accomplished by making the 5' and 3' terminal 
bases highly nuclease resistant and, optionally, by placing a 3' terminal blocking group. 

More particularly, the antisense oligonucleotide comprises a 5' terminus and a 3' terminus; and from 
position 11 to 59 5'->3'-linked nucleotides independently selected from the group consisting of 2'~ 
modified phosphodiester nucleotides and 2 '-modified P-alkyloxyphosphotriester nucleotides; and 
wherein the 5 '-terminal nucleoside is attached to an RNase H-activating region of between three and ten 
contiguous phosphorothioate-linked deoxyribonucleotides, and wherein the 3'~terminus of said 
oligonucleotide is selected from the group consisting of an inverted deoxyribonucleotide, a contiguous 
stretch of one to three phosphorothioate 2 '-modified ribonucleotides, a biotin group and a P- 
alkyloxyphosphotriester nucleotide. 

Also an antisense oligonucleotide may be used wherein not the 5' terminal nucleoside is attached to an 
RNase H-activating region but the 3' terminal nucleoside as specified above. Also, the 5' terminus is 
selected from the particular group rather than the 3' terminus of said oligonucleotide. 

The nucleic acids as well as the hyperimmune serum reactive antigens and fragments thereof according 
to the present invention may be used as or for the manufacture of pharmaceutical compositions, 
especially vaccines. Preferably such pharmaceutical composition, preferably vaccine is for the prevention 
or treatment of diseases caused by, related to or associated with S. agalactiae. In so far another aspect of 
the invention relates to a method for inducing an immunological response in an individual, particularly a 
mammal, which comprises inoculating the individual with the hyperimmune serum reactive antigens 
and fragments thereof of the invention, or a fragment or variant thereof, adequate to produce antibodies 
to protect said individual from infection, particularly streptococcal infection and most particularly S. 
agalactiae infections. 
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Yet another aspect of the invention relates to a method of inducing an immunological response in an 
individual which comprises, through gene therapy or otherwise, delivering a nucleic acid functionally 
encoding hyperimmune serum reactive antigens and fragments thereof, or a fragment or a variant 
thereof, for expressing the hyperimmune serum reactive antigens and fragments thereof, or a fragment or 
a variant thereof in vivo in order to induce an immunological response to produce antibodies or a cell 
mediated T cell response, either cytokine-producing T cells or cytotoxic T cells, to protect said individual 
from disease, whether that disease is already established within the individual or not. One-way of 
administering the gene is by accelerating it into the desired cells as a coating on particles or otherwise. 

A further aspect of the invention relates to an immunological composition which, when introduced into a 
host capable of having induced within it an immunological response, induces an immunological response 
in such host, wherein the composition comprises recombinant DNA which codes for and expresses an 
antigen of the hyperimmune serum reactive antigens and fragments thereof of the present invention. The 
immunological response may be used therapeutically or prophylactically and may take the form of 
antibody immunity or cellular immunity such as that arising from CTL or CD4+ T cells. 

The hyperimmune serum reactive antigens and fragments thereof of the invention or a fragment thereof 
may be fused with a co-protein which may not by itself produce antibodies, but is capable of stabilizing 
the first protein and producing a fused protein which will have immunogenic and protective properties. 
This fused recombinant protein preferably further comprises an antigenic co-protein, such as 
Glutathione-S-transferase (GST) or beta-galactosidase, relatively large co-proteins which solubilise the 
protein and facilitate production and purification thereof. Moreover, the co-protein may act as an 
adjuvant in the sense of providing a generalized stimulation of the immune system. The co-protein may 
be attached to either the amino or carboxy terminus of the first protein. 

Also, provided by this invention are methods using the described nucleic acid molecule or particular 
fragments thereof in such genetic immunization experiments in animal models of infection with S. 
agalactiae. Such fragments will be particularly useful for identifying protein epitopes able to provoke a 
prophylactic or therapeutic immune response. This approach can allow for the subsequent preparation of 
monoclonal antibodies of particular value from the requisite organ of the animal successfully resisting or 
clearing infection for the development of prophylactic agents or therapeutic treatments of S. agalactiae 
infection in mammals, particularly humans. 

The hyperimmune serum reactive antigens and fragments thereof may be used as an antigen for 
vaccination of a host to produce specific antibodies which protect against invasion of bacteria, for 
example by blocking adherence of bacteria to damaged tissue. Examples of tissue damage include 
wounds in skin or connective tissue and mucosal tissues caused e.g. by viral infection (esp. respiratory, 
such as the flu) mechanical, chemical or thermal damage or by implantation of indwelling devices, or 
wounds in the mucous membranes, such as the mouth, mammary glands, urethra or vagina. 

The present invention also includes a vaccine formulation, which comprises the immunogenic 
recombinant protein together with a suitable carrier. Since the protein may be broken down in the 
stomach, it is preferably administered parenterally, including, for example, administration that is 
subcutaneous, intramuscular, intravenous, intradermal intranasal or tramsdermal. Formulations suitable 
for parenteral administration include aqueous and non-aqueous sterile injection solutions which may 
contain anti-oxidants, buffers, bacteriostats and solutes which render the formulation isotonic with the 
bodily fluid, preferably the blood, of the individual; and aqueous and non-aqueous sterile suspensions 
which may include suspending agents or thickening agents. The formulations may be presented in unit- 
dose or multi-dose containers, for example, sealed ampoules and vials, and may be stored in a freeze- 
dried condition requiring only the addition of the sterile liquid carrier immediately prior to use. The 
vaccine formulation may also include adjuvant systems for enhancing the immunogenicity of the 
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formulation, such as oiHn-water systems and other systems known in the art. The dosage will depend on 
the specific activity of the vaccine and can be readily determined by routine experimentation. 

According to another aspect, the present invention relates to a pharmaceutical composition comprising 
such a hyperimmune serum-reactive antigen or a fragment thereof as provided in the present invention 
for S. agalactiae. Such a pharmaceutical composition may comprise one preferably at least two or more 
hyperimmune serum reactive antigens or fragments thereof against S. agalactiae. Optionally, such S. 
agalactiae hyperimmune serum reactive antigens or fragments thereof may also be combined with 
antigens against other pathogens in a combination pharmaceutical composition. Preferably, said 
pharmaceutical composition is a vaccine for preventing or treating an infection caused by S. agalactiae 
and/or other pathogens against which the antigens have been included in the vaccine. 

According to a further aspect, the present invention relates to a pharmaceutical composition comprising a 
nucleic acid molecule encoding a hyperimmune serum-reactive antigen or a fragment thereof as 
identified above for S. agalactiae. Such a pharmaceutical composition may comprise one or more nucleic 
acid molecules encoding hyperimmune serum reactive antigens or fragments thereof against S. agalactiae. 
Optionally, such S. agalactiae nucleic acid molecules encoding hyperimmune serum reactive antigens or 
fragments thereof may also be combined with nucleic acid molecules encoding antigens against other 
pathogens in a combination pharmaceutical composition. Preferably, said pharmaceutical composition is 
a vaccine for preventing or treating an infection caused by S. agalactiae and/or other pathogens against 
which the antigens have been included in the vaccine. 

The pharmaceutical composition may contain any suitable auxiliary substances, such as buffer 
substances, stabilisers or further active ingredients, especially ingredients known in connection of 
pharmaceutical composition and/or vaccine production. 

A preferable carrier/or excipient for the hyperimmune serum-reactive antigens, fragments thereof or a 
coding nucleic acid molecule thereof according to the present invention is an immunostimulatory 
compound for further stimulating the immune response to the given hyperimmune serum-reactive 
antigen, fragment thereof or a coding nucleic acid molecule thereof. Preferably the immunostimulatory 
compound in the pharmaceutical preparation according to the present invention is selected from the 
group of polycationic substances, especially polycationic peptides, immunostimulatory nucleic acids 
molecules, preferably immunostimulatory deoxynucleotides, alum, Freund's complete adjuvants, 
Freund's incomplete adjuvants, neuroactive compounds, especially human growth hormone, or 
combinations thereof. 

It is also within the scope of the present invention that the pharmaceutical composition, especially 
vaccine, comprises apart from the hyperimmune serum reactive antigens, fragments thereof and/or 
coding nucleic acid molecules thereof according to the present invention other compounds which are 
biologically or pharmaceutically active. Preferably, the vaccine composition comprises at least one 
polycationic peptide. Trie polycationic compound(s) to be used according to the present invention may be 
any polycationic compound, which shows the characteristic effects according to the WO 97/30721. 
Preferred polycationic compounds are selected from basic polyppetides, organic polycations, basic 
polyamino acids or mixtures thereof. These polyamino acids should have a chain length of at least 4 
amino acid residues (WO 97/30721). Especially preferred are substances like polylysine, polyarginine and 
polypeptides containing more than 20 %, especially more than 50 % of basic amino acids in a range of 
more than 8, especially more than 20, amino acid residues or mixtures thereof. Other preferred 
polycations and their pharmaceutical compositions are described in WO 97/30721 (e.g. 
polyethyleneimine) and WO 99/38528. Preferably these polypeptides contain between 20 and 500 amino 
acid residues, especially between 30 and 200 residues. 

These polycationic compounds may be produced chemically or recombinantly or may be derived from 
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natural sources. 

Cationic (poly)peptides may also be anti-microbial with properties as reviewed in {Ganz, T., 1999}. These 
(poiy)peptides may be of prokaryotic or animal or plant origin or may be produced chemically or 
recombinantly (WO 02/13857).- Peptides may also belong to the class of defensins (WO 02/13857). 
Sequences of such peptides can be, for example, found in the Antimicrobial Sequences Database under 
the following internet address: 

http://www.bbcm.univ.trieste.it/-tossi/pag2.html 

Such host defence peptides or defensives are also a preferred form of the polycationic polymer according 
to the present invention. Generally, a compound allowing as an end product activation (or down- 
regulation) of the adaptive immune system, preferably mediated by APCs (including dendritic cells) is 
used as polycationic polymer. 

Especially preferred for use as polycationic substances in the present invention are cathelicidin derived 
antimicrobial peptides or derivatives thereof (International patent application WO 02/13857, incorporated 
herein by reference), especially antimicrobial peptides derived from mammalian cathelicidin, preferably 
from human, bovine or mouse. 

Polycationic compounds derived from natural sources include HIV-REV or HIV-TAT (derived cationic 
peptides, antennapedia peptides, chitosan or other derivatives of cliitin) or other peptides derived from 
these peptides or proteins by biochemical or recombinant production. Other preferred polycationic 
compounds are cathelin or related or derived substances from cathelin. For example, mouse cathelin is a 
peptide which has the amino acid sequence NH2-RLAGLLRKGGEKIGEKLKKIGOKIKNFFQKLVPQPE- 
COOH. Related or derived cathelin substances contain the whole or parts of the cathelin sequence with at 
least 15-20 amino acid residues. Derivations may include the substitution or modification of the natural 
amino acids by amino acids which are not among the 20 standard amino acids. Moreover, further cationic 
residues may be introduced into such cathelin molecules. These cathelin molecules are preferred to be 
combined with the antigen. These cathelin molecules surprisingly have turned out to be also effective as 
an adjuvant for an antigen without the addition of further adjuvants. It is therefore possible to use such 
cathelin molecules as efficient adjuvants in vaccine formulations with or without further 
immunactivating substances. 

Another preferred polycationic substance to be used according to the present invention is a synthetic 
peptide containing at least 2 KLK-motifs separated by a linker of 3 to 7 hydrophobic amino acids 
(International patent application WO 02/32451, incorporated herein by reference). 

The pharmaceutical composition of the present invention may further comprise immunostimulatory 
nucleic acid(s). Immunostimulatory nucleic acids are e. g. neutral or artificial CpG containing nucleic 
acids, short stretches of nucleic acids derived from non-vertebrates or in form of short oligonucleotides 
(ODNs) containing non-methylated cytosine-guanine di-nucleotides (CpG) in a certain base context (e.g. 
described in WO 96/02555). Alternatively, also nucleic acids based on inosine and cytidine as e.g. 
described in the WO 01/93903, or deoxynucleic acids containing deoxy-inosine and/or deoxyuridine 
residues (described in WO 01/93905 and PCT/EP 02/05448, incorporated herein by reference) may 
preferably be used as immunostimulatory nucleic acids for the present invention. Preferablly, the 
mixtures of different immunostimulatory nucleic acids may be used according to the present invention. 

It is also within the present invention that any of the aforementioned polycationic compounds is 
combined with any of the immunostimulatory nucleic acids as aforementioned. Preferably, such 
combinations are according to the ones as described in WO 01/93905, WO 02/32451, WO 01/54720, WO 
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01/93903, WO 02/13857 and PCT/EP 02/05448 and the Austrian patent application A 1924/2001, 
incorporated herein by reference. 

In addition or alternatively such vaccine composition may comprise apart from the hyperimmune serum 
reactive antigens and fragments thereof, and the coding nucleic acid molecules thereof according to the 
present invention a neuroactive compound. Preferably, the neuroactive compound is human growth 
factor as, e.g. described in WO 01/24822. Also preferably, the neuroactive compound is combined with 
any of the polycationic compounds and/or immunostimulatory nucleic acids as afore-mentioned. 

In a further aspect the present invention is related to a pharmaceutical composition. Such pharmaceutical 
composition is, for example, the vaccine described herein. Also a pharmaceutical composition is a 
pharmaceutical composition which comprises any of the following compounds or combinations thereof: 
the nucleic acid molecules according to the present invention, the hyperimmune serum reactive antigens 
and fragments thereof according to the present invention, the vector according to the present invention, 
the cells according to the present invention, the antibody according to the present invention, the 
functional nucleic acids according to the present invention and the binding peptides such as the 
anticalines according to the present invention, any agonists and antagonists screened as described herein. 
In connection therewith any of these compounds may be employed in combination with a non-sterile or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a pharmaceutical carrier suitable 
for administration to a subject. Such compositions comprise, for instance, a media additive or a 
therapeutically effective amount of a hyperimmune serum reactive antigen and fragments thereof of the 
invention and a pharmaceutical^ acceptable carrier or excipient. Such carriers may include, but are not 
limited to, saline, buffered saline, dextrose, water, glycerol, ethanol and combinations thereof. The 
formulation should suit the mode of administration. 

The pharmaceutical compositions may be administered in any effective, convenient manner including, 
for instance, administration by topical, oral, anal, vaginal, intravenous, intraperitoneal, intramuscular, 
subcutaneous, intranasal, intratracheal or intradermal routes among others. 

In therapy or as a prophylactic, the active agent may be administered to an individual as an injectable 
composition, for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application, for example in the form of 
ointments, creams, lotions, eye ointments, eye drops, ear drops, mouthwash, impregnated dressings and 
sutures and aerosols, and may contain appropriate conventional additives, including, for example, 
preservatives, solvents to assist drug penetration, and emollients in ointments and creams. Such topical 
formulations may also contain compatible conventional carriers, for example cream or ointment bases, 
and ethanol or oleyl alcohol for lotions. Such carriers may constitute from about 1 % to about 98 % by 
weight of the formulation; more usually they will constitute up to about 80 % by weight of the 
formulation. 

In addition to the therapy described above, the compositions of this invention may be used generally as a 
wound treatment agent to prevent adhesion of bacteria to matrix proteins exposed in wound tissue and 
for prophylactic use in dental treatment as an alternative to, or in conjunction with, antibiotic 
prophylaxis. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants may be employed to 
enhance the immune response. A suitable unit dose for vaccination is 0.05-5 ug antigen / per kg of body 
weight, and such dose is preferably administered 1-3 times and with an interval of 1-3 weeks. 

With the indicated dose range, no adverse toxicological effects should be observed with the compounds 
of the invention, which would preclude their administration to suitable individuals. 
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In a further embodiment the present invention relates to diagnostic and pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. The ingredient(s) can be present in a useful amount, dosage, formulation 
or combination. Associated with such container(s) can be a notice in the form prescribed by a 
governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological products, 
reflecting approval by the agency of the manufacture, use or sale of the product for human 
administration. 

In connection with the present invention any disease related use as disclosed herein such as, e. g. use of 
the pharmaceutical composition or vaccine, is particularly a disease or diseased condition which is 
caused by, linked or associated with Streptococci, more preferably, S. pyogens and pneumoniae. In 
connection therewith it is to be noted that S. agalactiae comprises several strains including those disclosed 
herein. A disease related, caused or associated with the bacterial infection to be prevented and/or treated 
according to the present invention includes besides others bacterial pharyngitis, otitis media, pneumonia, 
bacteremia, meningitis, peritonitis, wound infection and sepsis in humans. 

In a still further embodiment the present invention is related to a screening method using any of the 
hyperimmune serum reactive antigens or nucleic acids according to the present invention. Screening 
methods as such are known to the one skilled in the art and can be designed such that an agonist or an 
antagonist is screened. Preferably an antagonist is screened which in the present case inhibits or prevents 
the binding of any hyperimmune serum reactive antigen and fragment thereof according to the present 
invention to an interaction partner. Such interaction partner can be a naturally occurring interaction 
partner or a non-naturally occurring interaction partner. 

The invention also provides a method of screening compounds to identify those, which enhance (agonist) 
or block (antagonist) the function of hyperimmune serum reactive antigens and fragments thereof or 
nucleic acid molecules of the present invention, such as its interaction with a binding molecule. The 
method of screening may involve high-throughput. 

For example, to screen for agonists or antagonists, the interaction partner of the nucleic acid molecule and 
nucleic acid, respectively, according to the present invention, maybe a synthetic reaction mix, a cellular 
compartment, such as a membrane, cell envelope or cell wall, or a preparation of any thereof, may be 
prepared from a cell that expresses a molecule that binds to the hyperimmune serum reactive antigens 
and fragments thereof of the present invention. The preparation is incubated with labelled hyperimmune 
serum reactive antigens and fragments thereof in the absence or the presence of a candidate molecule, 
which may be an agonist or antagonist. The ability of the candidate molecule to bind the binding 
molecule is reflected in decreased binding of the labelled ligand. Molecules which bind gratuitously, i. e., 
without inducing the functional effects of the hyperimmune serum reactive antigens and fragments 
thereof, are most likely to be good antagonists. Molecules that bind well and elicit functional effects that 
are the same as or closely related to the hyperimmune serum reactive antigens and fragments thereof are 
good agonists. 

The functional effects of potential agonists and antagonists may be measured, for instance, by 
determining the activity of a reporter system following interaction of the candidate molecule with a cell 
or appropriate cell preparation, and comparing the effect with that of the hyperimmune serum reactive 
antigens and fragments thereof of the present invention or molecules that elicit the same effects as the 
hyperimmune serum reactive antigens and fragments thereof. Reporter systems that may be useful in this 
regard include but are not limited to colorimetric labelled substrate converted into product, a reporter 
gene that is responsive to changes in the functional activity of the hyperimmune serum reactive antigens 
and fragments thereof, and binding assays known in the art. 
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Another example of an assay for antagonists is a competitive assay that combines the hyperimmune 
serum reactive antigens and fragments thereof of the present invention and a potential antagonist with 
membrane-bound binding molecules, recombinant binding molecules, natural substrates or ligands, or 
substrate or ligand mimetics, under appropriate conditions for a competitive inhibition assay. The 
hyperimmune serum reactive antigens and fragments thereof can be labelled such as by radioactivity or a 
colorimetric compound, such that the molecule number of hyperimmune serum reactive antigens and 
fragments thereof bound to a binding molecule or converted to product can be determined accurately to 
assess the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, peptides, polypeptides and antibodies that bind to 
a hyperimmune serum reactive antigen and fragments thereof of the invention and thereby inhibit or 
extinguish its acitivity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds to the same sites on a binding 
molecule without inducing functional activity of the hyperimmune serum reactive antigens and 
fragments thereof of the invention. 

Potential antagonists include a small molecule, which binds to and occupies the binding site of the 
hyperimmune serum reactive antigens and fragments thereof thereby preventing binding to cellular 
binding molecules, such that normal biological activity is prevented. Examples of small molecules 
include but are not limited to small organic molecules, peptides or peptide-like molecules. 

Other potential antagonists include antisense molecules (see {Okano, H. et al., 1991}; 
OLIGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION; CRC Press, Boca 
Ration, FL (1988), for a description of these molecules). 

Preferred potential antagonists include derivatives of the hyperimmune serum reactive antigens and 
fragments thereof of the invention. 

As used herein the activity of a hyperimmune serum reactive antigen and fragment thereof according to 
the present invention is its capability to bind to any of its interaction partner or the extent of such 
capability to bind to its or any interaction partner. 

In a particular aspect, the invention provides the use of the hyperimmune serum reactive antigens and 
fragments thereof, nucleic acid molecules or inhibitors of the invention to interfere with the initial 
physical interaction between a pathogen and mammalian host responsible for sequelae of infection. In 
particular the molecules of the invention may be used: i) in the prevention of adhesion of S. agalactiae to 
mammalian extracellular matrix proteins at mucosal surfaces and on in-dwelling devices or to 
extracellular matrix proteins in wounds; ii) to block bacterial adhesion between mammalian extracellular 
matrix proteins and bacterial proteins which mediate tissue damage or invasion iii) or lead to evasion of 
immune defense; iv) to block the normal progression of pathogenesis in infections initiated other than by 
the implantation of in-dwelling devices or by other surgical techniques, e.g. through inhibiting nutrient 
acquisition. 

Each of the DNA coding sequences provided herein may be used in the discovery and development of 
antibacterial compounds. The encoded protein upon expression can be used as a target for the screening 
of antibacterial drugs. Additionally, the DNA sequences encoding the amino terminal regions of the 
encoded protein or Shine-Delgarno or other translation facilitating sequences of the respective mRNA can 
be used to construct antisense sequences to control the expression of the coding sequence of interest. 

The antagonists and agonists may be employed, for instance, to inhibit diseases arising from infection 
with Streptococcus, especially S. agalactiae, such as sepsis. 
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In a still further aspect the present invention is related to an affinity device such affinity device comprises 
as least a support material and any of the hyperimmune serum reactive antigens and fragments thereof 
according to the present invention, which is attached to the support material. Because of the specificity of 
the hyperimmune serum reactive antigens and fragments thereof according to the present invention for 
their target cells or target molecules or their interaction partners, the hyperimmune serum reactive 
antigens and fragments thereof allow a selective removal of their interaction partner(s) from any kind of 
sample applied to the support material provided that the conditions for binding are met. The sample may 
be a biological or medical sample, including but not limited to, fermentation broth, cell debris, cell 
preparation, tissue preparation, organ preparation, blood, urine, lymph liquid, liquor and the like. 

The hyperimmune serum reactive antigens and fragments thereof may be attached to the matrix in a 
covalent or non-covalent manner. Suitable support material is known to the one skilled in the art and can 
be selected from the group comprising cellulose, silicon, glass, aluminium, paramagnetic beads, starch 
and dextrane. 

The present invention is further illustrated by the following figures, examples and the sequence listing, 
from which further features, embodiments and advantages may be taken. It is to be understood that the 
present examples are given by way of illustration only and not by way of limitation of the disclosure. 

In connection with the present invention 

Figure 1 shows the characterization of human antibody sources for S. agalactiae. 

Figure 2 shows the characterization of the small fragment genomic library, LSAg-70, from Streptococcus 
agalactiae ATCC 12403. 

Figure 3 shows the selection of bacterial cells by MACS using biotinylated human IgGs. 

Figure 4 shows the serotypes of the applied strains and an example for the gene distribution analysis with 
one of the identified antigens. 

Figure 5 shows examples for induction of epitope-specific antibodies in mice by immunization with E. coli 
lysates. 

Figure 6 shows examples for cell surface staining with epitope-specific antisera by flow cytometry. 

Figure 7 shows the determination of bactericidal activity of antibodies induced by selected epitopes in an 
in vitro assay. 

Table 1A shows the summary of all screens performed with genomic S. agalactiae libraries and human 
serum. Table IB shows antigenic proteins identified by sequence identity within antigenic regions of the 
proteins listed in Table 1A. 

Table 2 shows the summary of epitope serology analysis with human sera. 

Table 3 shows the summary of the gene distribution analysis for the identified antigens in 46 S. agalactiae 
strains. 

Table 4 shows the summary of mouse imunogenicity experiments. 

Table 5 shows the summary of all screens performed with genomic S. agalactiae libraries and human 
serum. 



WO 2004/099242 



PCT/EP2004/004856 



-49- 

Table 6 shows the summary of epitope serology analysis with human sera. 
Table 7 shows the summary of mouse imunogenicity experiments. 

The figures to which it might be referred to in the specification are described in the following in more 
details. 

Figure 1 shows the characterization of human sera and cervical secretions for anti-S. agalactiae antibodies 
as measured by immune assays. Total anti-S. agalactiae IgG and IgA antibody levels were measured by 
standard ELISA using total bacterial lysates or culture supernant fractions prepared from S. agalactiae 
serotype III strain ATCC 12403 as coating antigens. (A) Results of representative experiments are shown 
with healthy adult sera with total bacterial lysate proteins. Data are expressed as ELISA units calculated 
from absorbance at 405nm at a serum dilution in the linear range of detection (2.000X for IgA, 10,000 for 
IgG). Selected sera (out of 52) included in the healthy adult non-pregnant serum pool (NSag8-IgG,-IgA) 
are indicated by bold numbers. (B) Immunoblot analysis was performed on high titer sera selected by 
ELISA in order to ensure multiple immune reactivity with protein antigens. Results of a representative 
experiment using total bacterial lysate prepared from S. agalactiae serotype III ATCC 12403 strain and 
selected patients' sera at 5.000X dilution are shown. Blots were developed with anti-human IgG 
secondary antibody reagent. Low titer sera were included as negative controls. Mw: molecular weight 
markers. (C) shows selection of cervical secretions from noncolonized pregnant women by immunoblot 
analysis. Antibodies exctracted from cervical wicks were quantitated for IgA content. 2 |ag IgA from each 
prepaprations were tested for immunoreactivity using total bacterial lysate in a multi-well blotting 
apparatus. Blots were developed with anti-human IgA secondary antibodies. IgA preparation showing 
reactivity with GBS proteins (inidicated by arrows) were selected and pooled. 

Figure 2 (A) shows the fragment size distribution of the Streptococcus agalactiae ATCC 12403 small 
fragment genomic library, LSAg-70. After sequencing 576 randomly selected clones, sequences were 
trimmed (464) to eliminate vector residues and the numbers of clones with various genomic fragment 
sizes were plotted. (B) shows the graphic illustration of the distribution of the same set of randomly 
sequenced clones of LSAg-70 over the S. agalactiae ATCC 12403 chromosome. Rectangles indicate 
matching sequences to annotated ORFs and diamonds represent fully matched clones to non-coding 
chromosomal sequences in +/+ or +/- orientation. Circles position all clones with chimeric sequences. 
Numeric distances in base pairs are indicated over the circular genome for orientation. Partitioning of 
various clone sets within the library is given in numbers and percentage at the bottom of the figure. 

Figure 3 (A) shows the MACS selection with biotinylated human IgGs. The LSAg-70 library in pMAL9.1 
was screened with 15-20 |ag biotinylated IgG (PSagll-IgG, purified from human serum). As negative 
control, no serum was added to the library cells for screening. Number of cells selected after the 1 st , 2 nd 
and 3 rd elution are shown for each selection round (upper, middle and lower panel, respectively). (B) 
shows the reactivity of specific clones (1-26) selected by bacterial surface display as analysed by 
immunoblot analysis with the human serum IgG pool (PSagll-IgG, 4 |ag/|al) used for selection by MACS 
at a dilution of 1:3,000. As a loading control the same blot was also analysed with antibodies directed 
against the platform protein LamB at a dilution of 1:5,000 of hyperimmune rabbit serum. M, Molecular 
weigth marker; L, Extract from a clone expressing LamB without foreign peptide insert. 

Figure 4 (A) shows the representation of different serotypes of S. agalactiae clinical isolates analysed for 
the gene distribution study. A number of the strains were not typable and may represent additional 
serotypes. (B) shows the PCR analysis for the gene distribution of gbs0061 with the respective 
oligonucleotides and 46 S. agalactiae strains. The predicted size of the PCR fragments is 814 bp. 1-46, S. 
agalactiae strains, clinical isolates as shown under A; -, no genomic DNA added; +, genomic DNA from S. 
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agalactiae ATCC 12403, which served as template for library construction. 

Figure 5 shows the measurement of epitope-specific mouse serum IgG antibody levels induced by total 
bacterial lysates of Lamb or FhuA expressing E. coli clones with S. agalactiae-deiived epitopes. (A) shows 
representative peptide ELISA experiments with three sets of mouse sera (5 mice in each group, 1-5) 
generated by gbs0428, gbs0628 and gbs632 epitopes, respectively. Sera were tested at two different 
dilutions: black bars: 100X; grey bars; 1000X. Biotin-labeled synthetic peptides corresponding to the 
respective epitopes were used in the peptide ELISA. Sera induced with E. coli lysate without S. agalactiae 
derived epitopes are indicated as FhuA or LamB. (B) shows a typical immunoblotting experiment using 
lysates prepared from individual E. coli clones selected for mouse injections. Sera were depleted by E. coli 
lysate not carrying epitope to remove antibodies against E. coli proteins. Examples are shown for gbs0918, 
gbs0428, gbs0628 and gbs632 epitopes. Negative controls (-) are E. coli clones with empty platform 
proteins. Location of platform proteins LamB and FhuA is indicated by arrows. 

Figure 6 shows the detection of specific antibody binding on the cell surface of Streptococcus agalactiae by 
flow cytometry. In Figure 6A preimmune mouse sera and polyclonal sera raised against S. agalactiae 
serotype III lysate were incubated with S. agalactiae strain serotype III and analysed by flow cytometry. 
Control shows the level of non-specific binding of the secondary antibody to the surface of S. agalactiae 
cells. The histograms in figure 6B indicates the increased fluorescence due to specific binding of anti- 
gbs0031, anti-gbsl925 and anti-gbs0012 antibodies in comparison to the control sera generated against E. 
coli lysate containing only the 'empty 1 platform protein FhuA. 

Figure 7 shows the bactericidal activity of epitope specific antibodies as determined in in vitro killing 
assay. The killing activity of immune sera is measured parallel with and calculated relative to the 
appropriate control sera. Data are expressed as percentage of killing, that is the reduction on bacterial cfu 
numbers as a consequence of the presence of specific antibodies. Hyperimmune polyclonal mouse sera 
generated with S. agalactiae lysate and sera from non-immunized mice served as positive and negative 
controls for the assay, respectively. Immune sera generated with (A) gbs0012, gbs0016, gbs0031, gbs0428, 
gbsl306 and gbs2018 epitopes and with (B) gbs0233, gbs0419, gbs0942, gbs0975, gbsl038, gbsll44 and 
gbs2093 epitopes were tested for bactericidal activity and data are expressed relative to appropriate 
controls, such as sera induced with Lamb or FhuA expressing E. coli clones without S. agalactiae-derived 
epitopes. S. agalactiae serotype III cells were incubated with mouse phagocytic cells for 60 min, and 
surviving bacteria were quantified by counting cfus after plating on blood agar. 

Table 1: Immunogenic proteins identified by bacterial surface display. 

(A) Columns A, 300bp library of S.agalactiae ATCC 12403 in fhuA with NSag8-IgA (826), B, 300bp library 
in fhuA with PSaglO-IgA (768), C, 300bp library in fhuA with PSaglO-IgG (711), D, 300bp library in fhuA 
with PSagll-IgG (640), E, 70bp library in lamB with NSag8-IgA (1057), F, 70bp library in lamB with 
NSag8-IgG (869), G, 70bp library in lamB with PSaglO-IgA (904), H, 70bp library in lamB with PSaglO- 
IgA-adsorbed (493), I, 70bp library in lamB with PSaglO-IgG (910), J, 70bp library in lamB with PSagll- 
IgA (631), K, 70bp library in lamB with PSagll-IgG (926), L, 70bp library in lamB with PSagl8-IgA (691), 
M, 70bp library in lamB with PSag-sIgA (628); * prediction of antigenic sequences longer than 5 amino 
acids was performed with the program ANTIGENIC (Kolaskar and Tongaonkar, 1990). Table IB lists the 
immunogenic proteins identified by amino acid sequence identity with peptides identified by bacterial 
surface display. Antigenic peptides, which have been identified by bacterial surface-display possess 
identical counterparts in the listed proteins from S. agalactiae. The peptides have been shown to react with 
multiple human sera (see Table 2). Sera directed against these peptides can therefore recognize multiple 
proteins. 

Table 2: Epitope serology with human sera. 

Immune reactivity of individual synthetic peptides representing selected epitopes with human sera is 
shown. Extent of reactivity is expressed as +, ++ or +++, and summed from individual reactivities of 
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peptides with individual sera (13 patient and 9 healthy adult, 22 total). A total score for each peptide was 
calculated based on ELISA units as the sum of all reactivities. Scores were 2-8 for +, 9-16 for ++ and 17-26 
for +++ m ELISA units were calculated from OD 4 o5nm readings and the serum dilution after correction for 
background. Location of synthetic peptides within the antigenic ORFs according to the genome 
annotation of ATCC 12403 strain is given in columns aa from and aa to indicating the first and last amino 
acid residues, respectively. Peptide names: gbs0012.1 present in annotated ORF: gbs0012. 

Table 3: Gene distribution in S. agalactiae strains. 

Fourty six S. agalactiae strains as shown in Figure 4A were tested by PGR with oligonucleotides specific 
for the genes encoding relevant antigens. The PGR fragment of one selected PGR reaction was sequenced 
in order to confirm the amplification of the correct DNA fragment. * number of amino acid substitutions 
in a serotype IA strain as derived from sequencing as compared to S. agalactiae ATCC 12403. #, alternative 
strain used for sequencing, because gene was not present in the serotype IA strain. 

Table 4: Immunogenicity of antigenic epitopes. 

S. agalactiae antigens were tested for immunogenicity by immunization with E. coli clones harboring 
plasmids encoding the platform proteins LamB or FhuA fused to S. agalactiae peptides. The presence of 
epitope-specific antibodies were detected and measured by peptide ELISA and/or immunoblotting using 
the corresponding E. coli clone lysate, which served as immunogen. Results are expressed as + to +++++, 
and calculated for peptide ELISA as the sum of the reactivity of individual mouse sera based on ELISA 
units (as indicated on Fig. 5A) and for immunoblotting (IB) as the strength of reactivity of pooled (5 
individual) mouse sera with the epitope containing platform protein (as indicated on Fig. 5B). Location of 
synthetic peptides within the antigenic ORFs according to the genome annotation of ATCC 12403 strain is 
given in columns aa from and aa to indicating the first and last amino acid residues, respectively. 

Table 5: Immunogenic proteins identified by bacterial surface display. 

(A) 300bp library of S.agalactiae ATCC 12403 in fhuA with IC8-IgA (826), B, 300bp library in fhuA with 
PlO-IgA (768), C, 300bp library in fhuA with PlO-IgG (711), D, 300bp library in fhuA with Pll-IgG (640), 
E, 70bp library in lamB with IC8-IgA (1057), F, 70bp library in lamB with IC8-IgG (869), G, 70bp library in 
lamB with PlO-IgA (904), H, 70bp library in lamB with PlO-IgA-adsorbed (493), I, 70bp library in lamB 
with PlO-IgG (910), J, 70bp library in lamB with Pll-IgA (631), K, 70bp library in lamB with Pll-IgG 
(926), * prediction of antigenic sequences longer than 5 amino acids was performed with the program 
ANTIGENIC (Kolaskar and Tongaonkar, 1990). 

Table 6: Epitope serology with human sera. 

Immune reactivity of individual synthetic peptides representing selected epitopes with human sera is 
shown. Extent of reactivity is expressed as +, ++ or +++, and summed from individual reactivities of 
peptides with individual sera (13 patient and 9 healthy adult, 22 total). A total score for each peptide was 
calculated based on ELISA units as the sum of all reactivities. Scores were 2-8 for +, 9-16 for ++ and 17-30 
for +++. ELISA units were calculated from OD 4 o5nm readings and the serum dilution after correction for 
background. Location of synthetic peptides within the antigenic ORFs according to the genome 
annotation of ATCC 12403 strain is given in columns aa from and aa to indicating the first and last amino 
acid residues, respectively. Peptide names: gbs0233.1 present in annotated ORF: gbs0233. 

Table 7: Immunogenicity of antigenic epitopes in mice. 

S. agalactiae antigens were tested for immunogenicity by immunization with E. coli clones harboring 
plasmids encoding the platform proteins LamB or FhuA fused to S. agalactiae peptides. The presence of 
epitope-specific antibodies were detected and measured by peptide ELISA. Results are expressed as + to 
Mm , and calculated for peptide ELISA as the sum of the reactivity of individual mouse sera based on 
ELISA units (as indicated on Fig. 5). Location of epitopes within the antigenic ORFs according to the 
genome annotation of ATCC 12403 strain is given in columns aa from and aa to indicating the first and 
last amino acid residues, respectively. 
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EXAMPLES 

Example 1: Characterization and selection of human serum sources based on anti-S. agalactiae 
antibodies, preparation of antibody screening reagents 

Experimental procedures 

Enzyme linked immune assay (ELISA). 

ELISA plates (Maxisorb, Millipore) were coated with 5-10 p-g/ml total protein diluted in coating buffer 
(0.1M sodium carbonate pH 9.2). Three dilutions of sera (2,000X, 10,000X, 50,000X) were made in PBS- 
BSA. Highly specific Horse Radish Peroxidase (HRP)-conjugated anti-human IgG or anti-human IgA 
secondary antibodies (Southern Biotech) were used according to the manufacturers' recommendations 
(dilution: l,000x). Antigen-antibody complexes were quantified by measuring the conversion of the 
substrate (ABTS) to colored product based on OD405nm readings by automatic ELIAS reader (TECAN 
SUNRISE). 

Preparation of bacterial antigen extracts 

Total bacterial lysate: Bacteria were grown overnight in THB (Todd-Hewitt Broth) and lysed by repeated 
freeze-thaw cycles: incubation on dry ice/ethanol-mixture until frozen (1 min), then thawed at 37°C (5 
min): repeated 3 times. This was followed by sonication and collection of supernatant by centrifugation 
(3,500 rpm, 15 min, 4°C). 

Culture supernatant: After removal of bacteria by centrifugation, the supernatant of overnight grown 
bacterial cultures was precipitated with ice-cold ethanol by mixing 1 part supernatant with 3 parts 
absolute ethanol and incubated overnight at -20°C. Precipitates were collected by centrifugation (2,600 g, 
for 15 min). Dry pellets were dissolved either in PBS for ELISA, or in urea and SDS-sample buffer for 
SDS-PAGE and immunoblotting. The protein concentration of samples was determined by Bradford 
assay. 

Immunoblotting 

Total bacterial lysate and culture supernatant samples were prepared from in vitro grown S. agalactiae 
serotype III strain. 10 to 25|j.g total protein/lane was separated by SDS-PAGE using the BioRad Mini- 
Protean Cell electrophoresis system and proteins transferred to nitrocellulose membrane (ECL, 
Amersham Pharmacia). After overnight blocking in 5% milk, human sera were added at 2,000x dilution, 
and HRPO labeled anti-human IgG was used for detection. 

Extraction of antibodies from cervical wicks 

Cervical secretions were collected by absorbent cylindrical wicks (Polyfiltronics) which were introduced 
into the cervical canal during speculum examination and thereafter kept frozen until extraction. 
Extraction was done according to Hordnes et al, 1998 (provider of the samples). Briefly, wicks were mixed 
with PBS containing protease inhibitors, vortexed and fluid was drained from the tubes containing the 
wicks. The concentrations of total IgA and IgG antibodies in extracts were determined. 

Purification of antibodies for genomic screening. Five sera from both the patient and the healthy group were 
selected based on the overall anti-GBS titers for serum or cervical secretion pools used in the screening 
procedure. Antibodies against E. coli proteins were removed by incubating the heat-inactivated sera with 
whole cell £. coli cells (DHSalpha, transformed with pHIEll, grown under the same condition as used for 
bacterial surface display). Highly enriched preparations of IgGs from the pooled, depleted sera were 
generated by protein G affinity chromatography, according to the manufacturer's instructions (UltraLink 
Immobilized Protein G, Pierce). IgA antibodies were purified also by affinity chromatography using 
biotin-labeled anti-human IgA (Southern Biotech) immobilized on Streptavidin-agarose (GIBCO BRL). 
The efficiency of depletion and purification was checked by SDS-PAGE, Western blotting, ELISA and 
protein concentration measurements. 
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Results 

The antibodies produced against S. agalactiae by the human immune system and present in human sera 
are indicative of the in vivo expression of the antigenic proteins and their immunogenicity. These 
molecules are essential for the identification of individual antigens in the approach as described in the 
present invention, which is based on the interaction of the specific anti-GBS antibodies and the 
corresponding S. agalactiae peptides or proteins. To gain access to relevant antibody repertoires, human 
sera were collected from 

I. healthy pregnant women tested negative for cervical and anorectal carriage of GBS 

II. healthy pregnant women tested positive for cervical and/or anorectal carriage of GBS who's 
newborn remained GBS-free (although with antibiotic prevention). 

III. adults below <45 years of age without clinical disease. 

IV. naive individuals, youg children between 5 and 10 months of age, after they already lost 
maternal antibodies and have not acquired GBS-specific ones due to the lack of GBS disease. 

In addition cervical secretions were also collected from the first two groups of donors. The extrem value 
of these antibody sources is mainly the secretory IgA component, which is directly implicated as 
protective effector moluecule on mucosal surfaces. 

It is important to screen with antibodies from at least two different populations, pregnant women and 
nonpregnant adults, since GBS disease affects elderly and immunocompromised adults, as well. Within 
the pregnant study group, there are again two different patient categories, women who are GBS 
colonized and those who are noncolonized, to be included in the antigen screen. 

Antibodies in serum and other body fluids, such as mucosal secretions induced in individuals exposed to 
the pathogens are crucial for antigen identification. The exposure to GBS results in asymptomatic 
colonization, current or past acute or chronic infection. S. agalactiae colonization and infections are 
common, and antibodies are present as a consequence of natural immunization from previous 
encounters. It is likely that sera from high titer noncolonized individuals contain functional antibodies, 
which are able to eliminate carriage. At the same time certain antibodies might be induced against GBS 
components only if the antigen persist. For that reason sera from colonized individuals were also 
included. It has been shown that colonization is associated with capsular polysaccahride (CPS)-specific 
antibody responses. However, it is not clear whether sufficient level of antibodies to CPS's would prevent 
GBS colonization, since there are colonized women with both high and low levels of anti-CPS antibody, 
and the same is true for noncolonized pregnant women. 

However, there are reports that effector function and avidity of antibodies produced during pregnancy 
might be altered. It is important to recognize that most healthy adults are protected from invasive GBS 
disease and are less susceptible than newborns and the elderly. Antibodies from these individuals seem 
to be especially valuable for identification of the corresponding antigens. It is known that anti-GBS 
antibody levels increase with age. 

GBS is a mucosal pathogen and should induce IgA response; for that reason it was important to perform 
IgA-based screens, as well as IgG-based screens. The fact that some S. agalactiae strains express high 
affinity IgA-binding receptor also points to the importance of IgA in host response. Recently it was 
reported that not only IgG, but also IgA serum antibodies can be recognized by the FcRIII receptors of 
PMNs and promote opsonization {Phillips-Quagliata, J. et al., 2000}; {Shibuya, A. et al., 2000}. The 
primary role of IgA antibodies is neutralization, mainly at the mucosal surface. The level of serum IgA 
reflects the quality, quantity and specificity of the dimeric secretory IgA. For that reason the serum 
collection was not only analyzed for anti-streptococcal IgG, but also for IgA levels. In the ELISA assays 
highly specific secondary reagents were used to detect antibodies from the high affinity types, such as 
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IgG and IgA, but avoided IgM. Production of IgM antibodies occurs during the primary adaptive 
humoral response, and results in low affinity antibodies, while IgG and IgA antibodies had already 
undergone affinity maturation, and are more valuable in fighting or preventing disease. 

127 serum samples and 97 cervical secretions from pregnant women and 50 sera from healthy adults 
were characterized for anti-S. agalactiae antibodies by a series of immune assays. Primary characterization 
was done by ELISA using two different antigen preparations, such as total bacterial extract and culture 
supernatant proteins prepared from S. agalactiae serotype III ATCC 12403 strain. A representative 
experiment is shown in Fig. 1A using sera from the healthy adult population. Antibody titers were 
compared at given dilutions where the response was linear. Sera were ranked based on the IgG and IgA 
reactivity against the two complex antigenic mixtures (including serotype specific type III capsule), and 
the highest ones were selected for further testing by immunoblotting. This analysis confirmed a high 
antibody reactivity of the pre-selected sera against multiple GBS proteins, especially when compared to 
not selected, low-titer sera (Fig IB). However, ELISA ranking of sera did not always correlated with 
immunoblot signals suggesting that anti-capsular antibodies were abundant and dominated the ELISA 
reactivities against total bacterial extracts. Thus the final selection of sera to be included in antibody-pools 
was based mainly on multiple immunogenic bands in immunoblotting experiments. This extensive 
antibody characterization approach has led to the unambiguous identification of anti-GBS hyperimmune 
sera. 

The 97 cervical secretions were determined for IgA content, and same amount (2\xg) was tested for anti- 
GBS reactivity by immunoblotting. Positively selected sera (as it is shown in Fig. 1C) were devided into 
colonized and noncolonized IgA pools and used separately in bacterial surface display experiments. 

5 sera from both donor groups were selected and pooled for antigen identification by bacterial surface 
display. Selected sera included in the four pregnant women pools (PSAglO-IgG,~IgA, PSAgll-IgG, 
PSAgl8-IgG and PSAg-sIgA) and one healthy adult (non-pregnant) pool (NSAg8-IgG,-IgA). IgG and IgA 
antibodies were purified from pooled sera by affinity chromatography and depleted of E. coli -reactive 
antibodies to avoid background in the bacterial surface display screen. 

Example 2: Generation of highly random, frame-selected, small-fragment, genomic DNA libraries of 
Streptococcus agalactiae 

Experimental procedures 

Preparation of streptococcal genomic DNA. 50 ml Todd-Hewitt Broth medium was inoculated with S. 
agalactiae ATCC 12403 bacteria from a frozen stab and grown with aeration and shaking for 18 h at 37°C. 
The culture was then harvested, centrifuged with l,600x g for 15 min and the supernatant was removed. 
Bacterial pellets were washed 3 x with PBS and carefully re-suspended in 0.5 ml of Lysozyme solution 
(100 mg/ml). 0.1 ml of 10 mg/ml heat treated RNase A and 20 U of RNase Tl were added, mixed carefully 
and the solution was incubated for 1 h at 37°C. Following the addition of 0.2 ml of 20 % SDS solution and 
0.1 ml of Proteinase K (10 mg/ml) the tube was incubated overnight at 55°C. 1/3 volume of saturated 
NaCl was then added and the solution was incubated for 20 min at 4°C. The extract was pelleted in a 
microfuge (13,000 rpm) and the supernatant transferred into a new tube. The solution was extracted with 
PhOH/CHCb/IAA (25:24:1) and with CHCb/IAA (24:1). DNA was precipitated at room temperature by 
adding 0.6x volume of Isopropanol, spooled from the solution with a sterile Pasteur pipette and 
transferred into tubes containing 80% ice-cold ethanol. DNA was recovered by centrifuging the 
precipitates with 10-12,000x g, then dried on air and dissolved in ddH^O. 

Preparation of small genomic DNA fragments. Genomic DNA fragments were mechanically sheared into 
fragments ranging in size between 150 and 300 bp using a cup-horn sonicator (Bandelin Sonoplus UV 
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2200 sonicator equipped with a BB5 cup horn, 10 sec. pulses at 100 % power output) or into fragments of 
size between 50 and 70 bp by mild DNase I treatment (Novagen). It was observed that sonication yielded 
a much tighter fragment size distribution when breaking the DNA into fragments of the 150-300 bp size 
range. However, despite extensive exposure of the DNA to ultrasonic wave-induced hydromechanical 
shearing force, subsequent decrease in fragment size could not be efficiently and reproducibly achieved. 
Therefore, fragments of 50 to 70 bp in size were obtained by mild DNase I treatment using Novagen's 
shotgun cleavage kit. A 1:20 dilution of DNase I provided with the kit was prepared and the digestion 
was performed in the presence of MnCk in a 60 ul volume at 20°C for 5 min to ensure double-stranded 
cleavage by the enzyme. Reactions were stopped with 2 ul of 0.5 M EDTA and the fragmentation 
efficiency was evaluated on a 2% TAE-agarose gel. This treatment resulted in total fragmentation of 
genomic DNA into near 50-70 bp fragments. Fragments were then blunt-ended twice using T4 DNA 
Polymerase in the presence of 100 jaM each of dNTPs to ensure efficient flushing of the ends. Fragments 
were used immediately in ligation reactions or frozen at -20°C for subsequent use. 

Description of the vectors. The vector pMAL4.31 was constructed on a pASK-IBA backbone {Skerra, A., 
1994} with the beta-lactamase (bla) gene exchanged with the Kanamycin resistance gene. In addition the 
bla gene was cloned into the multiple cloning site. The sequence encoding mature beta-lactamase is 
preceded by the leader peptide sequence of ompA to allow efficient secretion across the cytoplasmic 
membrane. Furthermore a sequence encoding the first 12 amino acids (spacer sequence) of mature beta- 
lactamase follows the ompA leader peptide sequence to avoid fusion of sequences immediately after the 
leader peptidase cleavage site, since e.g. clusters of positive charged amino acids in this region would 
decrease or abolish translocation across the cytoplasmic membrane {Kajava, A. et al., 2000}. A Smal 
restriction site serves for library insertion. An upstream Fsel site and a downstream NotI site, which were 
used for recovery of the selected fragment, flank the Smal site. The three restriction sites are inserted after 
the sequence encoding the 12 amino acid spacer sequence in such a way that the bla gene is transcribed in 
the -1 reading frame resulting in a stop codon 15 bp after the Notl site. A +1 bp insertion restores the bla 
ORF so that beta-lactamase protein is produced with a consequent gain of Ampicillin resistance. 

The vector pMAL9.1 was constructed by cloning the lamB gene into the multiple cloning site of pEHl 
{Hashemzadeh-Bonehi, L. et al., 1998}. Subsequently, a sequence was inserted in lamB after amino acid 
154, containing the restriction sites Fsel, Smal and Notl. The reading frame for this insertion was 
constructed in such a way that transfer of frame-selected DNA fragments excised by digestion with Fsel 
and Notl from plasmid pMAL4.31 yields a continuous reading frame of lamB and the respective insert. 

The vector pHIEll was constructed by cloning the fhuA gene into the multiple cloning site of pEHl. 
Thereafter, a sequence was inserted in fhu A after amino acid 405, containing the restriction site Fsel, Xbal 
and NotL The reading frame for this insertion was chosen in a way that transfer of frame-selected DNA 
fragments excised by digestion with Fsel and Notl from plasmid pMAL4.31 yields a continuous reading 
frame of fhuA and the respective insert. 

Cloning and evaluation of the library for frame selection. Genomic S. agalactiae DNA fragments were ligated 
into the Smal site of the vector pMAL4.31. Recombinant DNA was electroporated into DH10B 
electrocompetent E. coli cells (GIBCO BRL) and transformants plated on LB-agar supplemented with 
Kanamycin (50 ug/ml) and Ampicillin (50 ug/ml). Plates were incubated over night at 37°C and colonies 
collected for large scale DNA extraction. A representative plate was stored and saved for collecting 
colonies for colony PCR analysis and large-scale sequencing. A simple colony PCR assay was used to 
initially determine the rough fragment size distribution as well as insertion efficiency. From sequencing 
data the precise fragment size was evaluated, junction intactness at the insertion site as well as the frame 
selection accuracy (3n+l rule). 

Cloning and evaluation of the library for bacterial surface display. Genomic DNA fragments were excised from 
the pMAL4.31 vector, containing the S. agalactiae library with the restriction enzymes Fsel and NotL The 
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entire population of fragments was then transferred into plasmids pMAL9.1 (LamB) or pHIEll (FhuA), 
which have been digested with Fsel and NotL Using these two restriction enzymes, which recognise an 8 
bp GC rich sequence, the reading frame that was selected in the pMAL4.31 vector is maintained in each of 
the platform vectors. The plasmid library was then transformed into E. coli DHSalpha cells by 
electroporation. Cells were plated onto large LB-agar plates supplemented with 50 ug/ml Kanamycin and 
grown over night at 37°C at a density yielding clearly visible single colonies. Cells were then scraped off 
the surface of these plates, washed with fresh LB medium and stored in aliquots for library screening at - 
80°C. 

Results 

Libraries for frame selection. Two libraries (LSAg-70 and LSAg-300) were generated in the pMAL4.31 
vector with sizes of approximately 70 and 300 bp, respectively. For each library, ligation and subsequent 
transformation of approximately 1 ug of pMAL4.31 plasmid DNA and 50 ng of fragmented genomic S. 
agalactiae DNA yielded 4x 10 5 to 2x 10 6 clones after frame selection. To assess the randomness of the 
libraries, approximately 576 randomly chosen clones of LSAg-70 were sequenced. The bioinformatic 
analysis showed that of these clones only very few were present more than once. Furthermore, it was 
shown that approximately 80% of the clones fell in the size range between 25 and 100 bp with an average 
size of approximately 40 bp (Figure 2). Allmost all sequences followed the 3n+l rule, showing that all 
clones were properly frame selected. 

Bacterial surface display libraries. The display of peptides on the surface of E. coli required the transfer of the 
inserts from the LSAg-70 and LSAg-300 libraries from the frame selection vector pMAL4.31 to the display 
plasmids pMAL9.1 (LamB) or pHIEll (FhuA). Genomic DNA fragments were excised by Fsel and Notl 
restriction and ligation of 5ng inserts with 0.1 ug plasmid DNA and subsequent transformation into 
DHSalpha cells resulted in 2-5x 10 6 clones. The clones were scraped off the LB plates and frozen without 
further amplification. 

Example 3: Identification of highly immunogenic peptide sequences from S. agalactiae using bacterial 
surface displayed genomic libraries and human serum 

Experimental procedures 

MACS screening. Approximately 2.5x 10 8 cells from a given library were grown in 5 ml LB-medium 
supplemented with 50 ug/ml Kanamycin for 2 h at 37°C. Expression was induced by the addition of 1 
mM IPTG for 30 min. Cells were washed twice with fresh LB medium and approximately 2x 10 7 cells re- 
suspended in 100 ul LB medium and transferred to an Eppendorf tube. 

10 to 20 ug of biotinylated, human IgGs purified from serum was added to the cells and the suspension 
incubated overnight at 4°C with gentle shaking. 900 ul of LB medium was added, the suspension mixed 
and subsequently centrifuged for 10 min at 6,000 rpm at 4°C (For IgA screens, 10 ug of purified IgAs 
were used and these captured with biotinylated anti-human-IgG secondary antibodies). Cells were 
washed once with 1 ml LB and then re-suspended in 100 ul LB medium. 10 ul of MACS microbeads 
coupled to streptavidin (Miltenyi Biotech, Germany) were added and the incubation continued for 20 min 
at 4°C. Thereafter 900 ul of LB medium was added and the MACS microbead cell suspension was loaded 
onto the equilibrated MS column (Miltenyi Biotech, Germany) which was fixed to the magnet. (The MS 
columns were equilibrated by washing once with 1 ml 70% EtOH and twice with 2 ml LB medium.) 

The column was then washed three times with 3 ml LB medium. After removal of the magnet, cells were 
eluted by washing with 2 ml LB medium. After washing the column with 3 ml LB medium, the 2 ml 
eluate was loaded a second time on the same column and the washing and elution process repeated. The 
loading, washing and elution process was performed a third time, resulting in a final eluate of 2 ml. 
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A second and third round of screening was performed as follows. The cells from the final eluate were 
collected by centrifugation and re-suspended in 1 ml LB medium supplemented with 50 ug/ml 
Kanamycin. The culture was incubated at 37°C for 90 min and then induced with 1 mM IPTG for 30 min. 
Cells were subsequently collected, washed once with 1 ml LB medium and suspended in 10 ul LB 
medium. 10 to 20 ug of human, biotinylated XgGs were added again and the suspension incubated over 
night at 4°C with gentle shaking. All further steps were exactly the same as in the first selection round. 
Cells selected after two rounds of selection were plated onto LB-agar plates supplemented with 50 ug/ml 
Kanamycin and grown over night at 37°C. 

Evaluation of selected clones by sequencing and Western blot analysis. Selected clones were grown overnight at 
37°C in 3 ml LB medium supplemented with 50 ug/ml Kanamycin to prepare plasmid DNA using 
standard procedures. Sequencing was performed at MWG (Germany) or in collaboration with TIGR 
(U.S.A.). 

For Western blot analysis approximately 10 to 20 ug of total cellular protein was separated by 10% SDS- 
PAGE and blotted onto HybondC membrane (Amersham Pharmacia Biotech, England). The LamB or 
FhuA fusion proteins were detected using human serum as the primary antibody at a dilution of 
approximately 1:3,000 to 1:5,000 and anti-human IgG or IgA antibodies coupled to HRP at a dilution of 
1:5,000 as secondary antibodies. Detection was performed using the ECL detection kit (Amersham 
Pharmacia Biotech, England). Alternatively, rabbit anti-FhuA or rabbit anti-LamB polyclonal immune 
sera were used as primary antibodies in combination with the respective secondary antibodies coupled to 
HRP for the detection of the fusion proteins. 

Results 

Screening of bacterial surface display libraries by magnetic activated cell sorting (MACS) using biotinylated Igs. 
The libraries LSag-70 in pMAL9.1 and LSag-300 in pHIEll were screened with pools of biotinylated, 
human IgGs and IgAs prepared from sera of healthy adults (NSag8-IgG,-IgA) or P10,ll,18 (see Example 
1: Preparation of antibodies from human serum). The selection procedure was performed as described under 
Experimental procedures. Figure 3A shows a representative example of a screen with the LSag-70 library 
and PSagll-IgGs. As can be seen from the colony count after the first selection cycle from MACS 
screening, the total number of cells recovered at the end is drastically reduced from 2xl0 7 cells to 
approximately 2x 10 4 cells, but the selection without antibodies added showed a similar reduction in cell 
numbers (Figure 3A). Therefore a second and third round of selection was performed. At the end of 
round three, approximately 10 4 cells was recovered with PSagll-IgGs, while only 2x 10 3 cells were 
recovered when no IgGs from human serum were added, clearly showing that selection was dependent 
on S. agalactiae specific antibodies. To evaluate the performance of the screen, 26 selected clones were 
picked randomly and subjected to immunoblot analysis with screening IgG pool (Figure 3B). This 
analysis revealed that more than 80% of the selected clones showed reactivity with antibodies present in 
the relevant serum whereas the control strain expressing LamB without a S. agalactiae specific insert did 
not react with the same serum. In general, the rate of reactivity was observed to lie within the range of 35 
to 90%. Colony PCR analysis showed that all selected clones contained an insert in the expected size 
range. 

Subsequent sequencing of a larger number of randomly picked clones (600 to 1200 per screen) led to the 
identification of the gene and the corresponding peptide or protein sequence that was specifically 
recognized by the human serum antibodies used for screening. The frequency with which a specific clone 
is selected reflects at least in part the abundance and/or affinity of the specific antibodies in the serum 
used for selection and recognizing the epitope presented by this clone. In that regard it is striking that 
clones derived from some ORFs (e.g. gbsl087, gbsl306, gbs2018) were picked more than 100 times, 
indicating their highly immunogenic property. Table 1 and Table 5 summarizes the data obtained for all 
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13 performed screens. All clones that are presented in Table 1 and Table 5 have been verified by 
irnmunoblot analysis using whole cellular extracts from single clones to show the indicated reactivity 
with the pool of human serum used in the respective screen. As can be seen from Table 1 and Table 5, 
distinct regions of the identified ORF are identified as immunogenic, since variably sized fragments of 
the proteins are displayed on the surface by the platform proteins. 

It is further worth noticing that most of the genes identified by the bacterial surface display screen encode 
proteins that are either attached to the surface of S. agalactiae and/or are secreted. This is in accordance 
with the expected role of surface attached or secreted proteins in virulence of S. agalactiae. 

Example 4: Assessment of the reactivity of highly immunogenic peptide sequences with individual 
human sera. 

Experimental procedures 

Peptide synthesis 

Peptides were synthesized in small scale (4 mg resin; up to 288 in parallel) using standard F-moc 
chemistry on a Rink amide resin (PepChem, Tubingen, Germany) using a SyroII synthesizer 
(Multisyntech, Witten, Germany). After the sequence was assembled, peptides were elongated with 
Fmoc-epsilon-aminohexanoic acid (as a linker) and biotin (Sigma, St. Louis, MO; activated like a normal 
amino acid). Peptides were cleaved off the resin with 93%TFA, 5% triethylsilane, and 2% water for one 
hour. Peptides were dried under vacuum and freeze dried three times from acetonitrile/water (1:1). Hie 
presence of the correct mass was verified by mass spectrometry on a Reflex III MALDI-TOF (Bruker, 
Bremen Germany). The peptides were used without further purification. 

Enzyme linked immune assay (ELISA). 

Biotin-labeled peptides (at the N-terminus) were coated on Streptavidin ELISA plates (EXICON) at 10 
jug/ml concentration according to the manufacturer's instructions. Highly specific Horse Radish 
Peroxidase (HRP)-conjugated anti-human IgG secondary antibodies (Southern Biotech) were used 
according to the manufacturers' recommendations (dilution: l,000x). Sera were tested at two serum 
dilutions, 200X and 1,000X. Following manual coating, peptide plates were processed and analyzed by 
the Gemini 160 ELISA robot (TECAN) with a built-in ELISA reader (GENIOS, TEC AN). 

Results 

Following the bioinformatic analysis of selected clones, corresponding peptides were designed and 
synthesized. In case of epitopes with more than 26 amino acid residues, overlapping peptides were made. 
All peptides were synthesized with a N-terminal biotin-tag and used as coating reagents on Streptavidin- 
coated ELISA plates. 

The analysis was performed with peptides that were selected based on their reactivity with the individual 
sera, which were included in the serum pools used for preparations of IgG and IgA screening reagents for 
bacterial surface display. A summary for serum reactivity of 74 peptides representing 55 different S. 
agalactiae antigenic proteins from the genomic screen analysed with 22 human sera (from 13 patient and 9 
healthy exposed high titer individuals) used for the antigen identification is shown in Table 2 and Table 
6. The peptides were compared by the score calculated for each peptide based on the number of positive 
sera and the extent of reactivity. Peptides range from highly and widely reactive to weakly positive ones. 
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Example 5: Gene distribution studies with highly immunogenic proteins identified from S. agalactiae. 
Experimental procedures 

Gene distribution ofGBS antigens by PCR. An ideal vaccine antigen would be an antigen that is present in 
all, or the vast majority of strains of the target organism to which the vaccine is directed. In order to 
establish whether the genes encoding the identified Streptococcus agalactiae antigens occur ubiquitously in 
S. agalactiae strains, PCR was performed on a series of independent S. agalactiae isolates with primers 
specific for the gene of interest. S. agalactiae isolates were obtained covering the serotypes most frequently 
present in patients as shown in Figure 4A. Oligonucleotide sequences as primers were designed for all 
identified ORFs yielding products of approximately 1,000 bp, if possible covering all identified 
immunogenic epitopes. Genomic DNA of all S. agalactiae strains was prepared as described under 
Example 2. PCR was performed in a reaction volume of 25 \A using Taq polymerase (1U), 200 nM dNTPs, 
10 pMol of each oligonucleotide and the kit according to the manufacturers instructions (Invitrogen, The 
Netherlands). As standard, 30 cycles (lx: 5min. 95°C, 30x: 30sec. 95°C, 30sec. 56°C, 30sec. 72°C, lx 4min. 
72°C) were performed, unless conditions had to be adapted for individual primer pairs. 

Results 

Identified genes encoding immunogenic proteins were tested by PCR for their presence in 46 different 
strains of S. agalactiae (Figure 4A). As an example, figure 4B shows the PCR reaction for GBS0061 with all 
indicated 46 strains. As clearly visible, the gene is present in all strains analysed. The PCR fragment from 
a type IA strain was sequenced and showed that all 657 bp were identical as compared to the S. agalactiae 
ATCC 12403 strain, indicating a high level of conservation between the two isolates. 

From a total of 117 genes analysed, more than 100 were present in all or almost all strains tested, while 
only 5 genes were absent in more than 10% of the tested 46 strains (Table 3). In addition, only few genes 
(e.g. GBS0016, GBS1087, GBS1528 and GBS2018) showed variation in size but were present in all or most 
strain isolates. Sequencing of the generated PCR fragment from one strain and subsequent comparison to 
the type III strain ATCC 12403 confirmed the amplification of the correct DNA fragment and revealed a 
degree of sequence divergence as indicated in Table 3. Importantly, many of the identified antigens are 
well conserved in all strains in sequence and size and are therefore novel vaccine candidates to prevent 
infections by GBS. 

Example 6: Characterization of immune sera obtained from mice immunized with highly 
immunogenic proteins/peptides from S. agalactiae displayed on the surface of E. coli. 

Experimental procedures 

Generation of immune sera from mice 

E. coli clones harboring plasmids encoding the platform protein fused to a S. agalactiae peptide, were 
grown in LB medium supplemented with 50fig/ml Kanamycin at 37°C. Overnight cultures were diluted 
1:10, grown until an ODeoo of 0.5 and induced with 0.2 mM IPTG for 2 hours. Pelleted bacterial cells were 
suspended in PBS buffer and disrupted by sonication on ice, generating a crude cell extract. According to 
the OD600 measurement, an aliquot corresponding to 5xl0 7 cells was injected into NMRI mice i.v., 
followed by a boost after 2 weeks. Serum was taken 1 week after the second injection. Epitope specific 
antibody levels were measured by peptide ELISA. 

In vitro expression of antigens 

Expression of antigens by in vitro grown S. agalactiae serotype III was tested by immunoblotting. Different 
growth media and culture conditions were tested to detect the presence of antigens in total lysates and 
bacterial culture supernatants. Expression was considered confirmed when a specific band corresponding 
to the predicted molecular weight and electrophoretic mobility was detected. 
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Cell surface staining 

Flow cytometric analysis was carried out as follows. Bacteria were grown under culture conditions, 
which resulted in expression of the antigen as shown by the immunoblot analysis. Cells were washed 
twice in Hanks Balanced Salt Solution (HBSS) and the cell density was adjusted to approximately 1 X 10 6 
CFU in lOO^il HBSS, 0.5% BSA. After incubation for 30 to 60 min at 4°C with mouse antisera diluted 50 to 
100-fold, unbound antibodies were washed away by centrifugation in excess HBSS, 0.5% BSA. Secondary 
goat anti-mouse antibody (F(ab'> fragment specific) labeled with fluorescein (FITC) was incubated with 
the cells at 4°C for 30 to 60 min. After washing, cells were fixed with 2% paraformaldehyde. Bound 
antibodies were detected using a Becton Dickinson FACScan flow cytometer and data further analyzed 
with the computer program CELLQuest. Negative control sera included mouse pre-immune serum and 
mouse polyclonal serum generated with lysates prepared from IPTG induced E. coli cells transformed 
with plasmids encoding the genes lamB or fhuA without S. agalactiae genomic insert. 

Bactericidal (killing) assay 

Murine macrophage cells (RAW246.7 or P388.D1) and bacteria were incubated and the loss of viable 
bacteria after 60 min was determined by colony counting. In brief, bacteria were washed twice in Hanks 
Balanced Salt Solution (HBSS) and the cell density was adjusted to approximately IX 10 5 CFU in 50(^1 
HBSS. Bacteria were incubated with mouse sera (up to 25%) and guinea pig complement (up to 5%) in a 
total volume of 100|al for 60min at 4°C. Pre-opsonized bacteria were mixed with macrophages (murine 
cell line RAW264.7 or P388.D1; 2X 10 6 cells per W0[xl) at a 1:20 ratio and were incubated at 37°C on a 
rotating shaker at 500 rpm. An aliquot of each sample was diluted in sterile water and incubated for 5 
min at room temperature to lyse macrophages. Serial dilutions were then plated onto Todd-Hewitt Broth 
agar plates. The plates were incubated overnight at 37°C, and the colonies were counted with the 
Countermat flash colony counter (IUL Instruments). Control sera included mouse pre-immune serum 
and mouse polyclonal serum generated with lysates prepared from IPTG induced E. coli transformed 
with plasmids harboring the genes lamB oifttuA without S. agalactiae genomic insert. 

Results 

Immunogenicity in mice. The presence of specific antibodies was determined by peptide ELISA and/or 
immunoblotting using the E. coli clone expressing the given epitope embedded in LamB or FhuA 
platform proteins, as it is exemplified in Fig. 5A and B, respectively, and summarized in Table 4 and 
Table 7. 43 novel GBS antigens represented by 61 different epitope regions were shown to be 
immunogenic in mice. Positive sera were then analysed by immunblotting using total bacterial lysates 
and culture supernatants prepared from S. agalactiae serotype III strain (data not shown). This analysis 
served as a first step to determine whether the antigenic proteins were expressed, and if, under which 
growth conditions, in order to evaluate surface expression of the polypeptide by FACS analysis. It was 
anticipated based on literature data that not all proteins would be expressed under in vitro conditions. 

Cell surface staining of S. agalactiae. Cell surface accessibility for several antigenic proteins was 
subsequently demonstrated by an assay based on flow cytometry. GBS cells were incubated with 
preimmune and polyclonal mouse sera raised against S. agalactiae lysate or E. coli clones harboring 
plasmids encoding the platform protein fused to a S. agalactiae peptide, followed by detection with 
fluorescently tagged secondary antibody. As shown in Fig. 6A, antisera raised against S. agalactiae lysate 
contains antibodies against surface components, demonstrated by a significant shift in fluorescence of the 
S. agalactiae serotype III cell population. Similar cell surface staining of S. agalactiae serotype III cells was 
observed with polyclonal sera raised against peptides of many of the GBS antigens identified (Fig. 6B ). In 
some instancies, a subpopulation of the bacteria was not stained, as indicated by the detection of two 
peaks in the histograms (Fig. 6B). This phenomenon may be a result of differential expression of the gene 
products during the growth of the bacterium, insufficient antibody levels or partial inhibition of antibody 
binding caused by other surface molecules or plasma proteins. Importantly, a well-known protective GBS 
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antigen, Sip/gbs0031 is proved to be also positive in this assay. 

In vitro bactericidal activity. Opsonophagocytic killing is the cornerstone of host defense against 
extracellular bacteria, such as S. agalactiae. Cell surface binding of antibodies to bacterial antigens are 
opsonizing and induce killing (bactericidal) by phagocytic cells (macrophages and neutrophil 
granulocytes) if the antibodies induced by the particular antigens can bind activated complement 
components (C3bi). In Figure 7 data are presented on bactericidal activity measured by antigen-specific 
antibodies generated in mice with corresponding epitopes. According to these data, several of the novel 
GBS antigens, for example gbs0012, gbs0016, gbs0428, gbsl306 and gbs2018 induce functional antibodies. 
Importantly, a well-known protective GBS antigen, Sip/gbs0031 is proved to be strongly positive in the 
very same assay. 

These experiments confirmed the bioinformatic prediction that many of the proteins are exported due to 
their signal peptide sequence and in addition showed that they are present on the cell surface of S. 
agalactiae serotype III. They also confirm that these proteins are available for recognition by human 
antibodies with functional properties and make them valuable candidates for the development of a 
vaccine against GBS diseases. 
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Table 1A: Immunogenic proteins identified by bacterial surface display. 



S. agalactiae 
antigenic 
protein 


Putative function 
(by homology) 


predicted immunogenic aa** 


No. of 
selected 
clones per 
ORF and 
screen 


Location of 
identified 
immunogeni 
c region (aa) 


Seq. 
ID 
(DNA, 
Frot.) 


gbs0012 


weakly similar to 
beta-lac tamase 


4-20,35-44,65-70,73-87,92-98,112-137,152-161,177- 
186,193-200,206-213,229-255,282-294,308-313,320- 
326,349-355,373-384,388-406,420-425 


B:13, C:6 


115-199 


1, 218 


gbs0016 


glucan-binding 
protein B 


5-24,35-41,44-70,73-89,103-109,127-143,155-161,185- 

190,192-207,212-219,246-262,304-336,372-382,384- 

393,398-407,412-418,438-444 


B:12, C:4, 
D:3, E:5, H:4, 
1:12, M:2 


1-75, 76-161, 
164-239 


2,219 


gbs0024 


phosphori bosy Iformyl 

glycinamidine 

synthase 


4-10,16-58,60-71,77-92,100-126,132-146,149-164,166- 

172,190-209,214-220,223-229,241-256,297-312,314- 

319,337-343,351-359,378-387,398-418,421-428,430- 

437,440-448,462-471,510-519,525-536,552-559,561- 

568,573-582,596-602,608-630,637-649,651-665,681- 

702,714-732,739-745,757-778,790-805,807-815,821- 

829,836-842,846-873,880-903,908-914,916-923,931- 

940,943-948,956-970,975-986,996-1015,1031-1040,1051- 

1069,1072-1095,1114-1119,1130-1148,1150-1157,1169- 

1176,1229-1238 


F:4 


802-812 


3,220 


gbsOOSl 


surface immunogenic 
protein 


5-12,14-26,35-47,52-67,72-78,83-98,121-141,152-159,163- 

183,186-207,209-257,264-277,282-299,301-309,312- 

318,324-339,358-368,372-378,387-397,425-431 


A:17, B:53, 
C:36, D:4 


46-291 


4, 221 


gbs0048 


Hypothetical protein 


29-38,44-64,70-76,78-87,94-100,102-112,119-134,140- 
149,163-173,178-186,188-194,207-234,247-262,269-290 


K:13 


73-92 


5, 222 


gbs0053 


aldehyde-alcohol 

dehydrogenase 

(adhE) 


10-28,36-63,77-87,103-119,127-136,141-169,171-183,195- 

200,207-232,236-246,251-265,268-283,287-297,314- 

322,335-343,354-363,384-390,405-411,419-436,443- 

455,467-473,480-513,518-529,550-557,565-585,602- 

608,616-625,632-660,665-677,685-701,726-736,738- 

747,752-761,785-796,801-813,838-853,866-871 


E:4 


757-774: 


6, 223 


gbs0061 


rplB ribosomal 
protein L2 


31-38,61-66,74-81,90-115,123-145,154-167,169-179,182- 
193,200-206,238-244,267-272 


F:2, 1:12 


235-251 


7,224 


gbs0084 


DNA-directed RNA 
polymerase, alpha 
subunit (rpoA) 


19-25,38-54,56-64,66-72,74-92,94-100,116-129,143- 
149,156-183,204-232,253-266,269-275,294-307 


C:4, D:6 


241-313 


8, 225 


gbs0107 


conserved 
hypothetical protein 


5-34,50-56,60-65,74-85,89-97,108-119,159-165,181- 
199,209-225,230-240,245-251,257-262,274-282,300-305 


K:2 


64-75 


9, 226 
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S. agalactiae 
antigenic 
protein 


Putative function 
(by homology) 


predicted immunogenic aa* * 


No. of 
selected 
clones per 
ORF and 

screen 


Location of 
identified 
immunogeni 
c region (aa) 


Seq. 

ID 
(DNA, 
Prot.) 


gbs0108 


deoxyuridine 5"~ 

triphosphate 

nucleotidohydrolase 


5-13,16-21,27-42,45-52,58-66,74-87,108-114,119-131 


1:5 


39-51 


10, 227 


gbs0113 


ribose ABC 
transporter 


6-23,46-54,59-65,78-84,100-120,128-133,140-146,159- 

165,171-183,190-204,224-232,240-248,250-259,274- 

280,288-296,306-315 


F:4 


267-274 


11, 228 


gbs0123 


similar to 

argininosuccinate 

synthase 


5-12,15-24,26-36,42-65,68-80,82-104,111-116,125- 

144,159-167,184-189,209-218,235-243,254-265,269- 

283,287-300,306-316,318-336,338-352,374-392 


K:17 


162-174 


12, 229 


gbs0127 


rpmV SOS ribosomal 
protein L28 


30-42,45-54 


F:ll 


25-37 


13, 230 


gbs0144 


oligopeptide ABC 

transporter, 

substrate-binding 


10-30,53-59,86-95,116-130,132-147,169-189,195-201,212- 
221,247-256,258-265,278-283,291-298,310-316,329- 
339,341-352,360-367,388-396,398-411,416-432,443- 
452,460-466,506-512,515-521,542-548 


E:7 


419-431 


14, 231 


gbs0183 


membrane protein, 
putative 


4-27,30-53,60-67,70-90,92-151,159-185,189-195,198- 
210,215-239 


F:9 


173-189 


15, 232 


gbs0184 


oligopeptide ABC 

transporter, 

oligopeptide-binding 


4-26,41-54,71-78,116-127,140-149,151-158,161-175,190- 
196,201-208,220-226,240-252,266-281,298-305,308- 
318,321-329,344-353,372-378,384-405,418-426,429- 
442,457-463,494-505,514-522 


E:6 


174-188 


16, 233 


gbs0235 


glycine 

betaine/carnitine/chol 
ine ABC transporter 


17-25,27-39,61-67,81-89,99-110,120-131,133-139,147- 

161,167-172,179-185,192-198,203-213,226-238,243- 

258,261-267,284-290,296-307,311-328,340-352,356-371 


G:8, H:15 


239-256 


17, 234 


gbs0255 


conserved 

hypothetical protein 


8-30,40-49,67-80,114-123,126-142,152-162,188-194 


E:2 


57-70 


18, 235 


gbs0260 


glycyl-tRNA 
synthetase (beta 
subunit) 


4-23,28-34,36-47,50-61,76-81,89-94,96-104,112-119,126- 

146,155-181,195-200,208-214,220-229,244-260,263- 

276,282-288,292-300,317-323,336-351,353-359,363- 

375,382-399,415-432,444-455,458-471,476-481,484- 

492,499-517,522-529,535-541,543-568,572-584,586- 

600,607-617,626-637,656-675 


F:3 


282-297 


19, 236 


gbs0268 


transketolase (tkt) 


6-24,30-35,38-45,63-91,134-140,146-160,167-188,214- 
220,226-234,244-250,260-270,286-301,316-329,340- 
371,429-446,448-459,474-481,485-491,512-526,537- 
544,550-565,573-583,596-613,621-630,652-658 


E:7 


87-97 


20, 237 


gbs0286 


NH3-dependent 


8-20,26-48,56-67,76-86,94-109,115-121,123-129,143- 


F:12,G:7,H:8 


237-247 


21, 238 
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S. agalactiae 
antigenic 
protein 
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NAD+ synthetase 


160,178-186,191-198,201-208,221-236,238-244,260-268 








gbs0288 


similar to penicillin- 
binding protein 1A 


4-40,42-57,73-87,98-117,126-135,150-156,166-174,196- 
217,231-236,248-258,276-284,293-301,307-313,339- 
347,359-365,375-387,395-402,428-440,445-456,485- ' 
490,497-505,535-541,547-555,610-625,648-656,665-671 


D:5, K:3 


448-528 


22, 239 


gbs0343 


seryl-tRNA 
synthetase (serS) 


10-18,39-45,51-61,80-96,98-106,110-115,158-172,174- 

183,191-200,220-237,249-255,274-289,308-324,331- 

341,372-381,384-397,405-414 


1:3 


322-338 


23, 240 


gbs0411 


Hypothetical protein 


30-36,38-56,85-108,134-147,149-160,163-183,188- 
201,206-211,219-238,247-254 


1:11 


5-13 


24, 241 


gbs0428 


similar to fibrinogen 
binding protein, 
putative 

peptidoglycan linked 
protein (LPXTG 
motif) 


11-40,98-103,110-115,133-145,151-159,172-179,192- 
201,204-212,222-228,235-245,258-268,283-296,298- 
309,322-329,342-351,354-362,372-378,385-393,407- 
418,495-516 


A:7, B:2, C:31 


1-148 


25, 242 


gbs0437 


glucose-6-phosphate 
isomerase (pgi) 


5-19,21-36,73-94,112-119,122-137,139-145,152-167,184- 
190,198-204,208-224,249-265,267-281,299-304,309- 
317,326-333,356-364,368-374,381-389,391-414,419- 
425,430-435 


1:26 


113-140 


26, 243 


gbs0460 


decarboxylase 


45-54,59-67,78-91 


1:7, K:ll 


15-23 


27,244 


gbs0465 


oxydoreductase 


11-22,33-47,52-80,88-112,124-129 


F:4 


6-25 


28, 245 


gbs0470 


similar to alpha 
protein, putative 
peptidoglycan linked 
protein (LPXTG 
motif) 


26-41,51-63,80-89,93-115,150-163,187-193,220-237,240- 

249,286-294,296-306,316-329,345-353,361-370,407- 

425,428-437,474-482,484-494,504-517,533-541,549- 

558,595-613,616-625,660-668,673-685,711-726,736- 

744,749-761,787-802,812-820,825-837,863-878,888- 

896,901-913,939-954,964-972,977-989,1003-1008,1016- 

1022,1028-1034,1041-1053,1059-1074,1101-1122 


B:4, C:2, D:8 


420-511, 581- 
704 


29, 246 


gbs0489 


acetyltransferase, 
GNAT family 


18-25,27-55,71-83,89-95,102-113,120-146,150-156,174- 
185 


E:32 


159-175 


30, 247 


gbs0492 


gbs0492 valyl-tRNA 
synthetase 


24-30,38-56,63-68,87-93,136-142,153-164,183-199,213- 

219,226-234,244-261,269-278,283-289,291-297,320- 

328,330-336,340-346,348-356,358-366,382-387,401- 

408,414-419,449-455,468-491,504-512,531-537,554- 

560,597-608,621-627,632-643,650-662,667-692,703- 

716,724-737,743-758,783-794,800-818,846-856 


A:3 


806-884 


31, 248 
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gbs0538 


amino acid ABC 
transporter (ATP- 
Dinding protein) 


4-14,21-39,86-92,99-107,121-131,136-144,147-154,158- 
166,176-185,193-199,207-222,224-230 


G:l 


117-136 


32, 249 


gbs0539 


similar to 

phosphomannomuta 

se 


65-76,85-97,103-109,115-121,125-146,163-169,196- 
205,212-219,228-237,241-247,254-262,269-288,294- 
303,305-313,328-367,395-401,405-412,418-429,437- 
447,481-488,506-513,519-524,530-541,546-557 j 


K:4 


266-284 


33, 250 


gbs0555 


beta-lactam 
resistance factor 
(fibA) 


5-14,37-42,49-71,78-92,97-112,127-136,147-154,156- 

163,186-198,216-225,233-243,248-253,295-307,323- 

332,359-366,368-374,380-398 


E:3 


194-223 


34, 251 


gbs0579 


dipeptidase 


4-11,33-39,45-72,100-113,119-129,136-144,169-175,177- 

185,200-208,210-219,262-276,278-297,320-326,336- 

344,347-362,381-394,443-453 


1:4 


438-454 


35, 252 


gbs0580 


zinc ABC transporter, 
zinc-binding 
adhesion, lipoprotein 


4-29,31-52,55-61,95-110,138-158,162-171,179-187,202- 

229,239-248,251-256,262-267,269-285,304-310,351- 

360,362-368,381-388,415-428,435-440,448-458 


1:11 


161-178 


36, 253 


gbs0628 


cell wall surface 
anchor family protein 
(IPxTG) 


4-17,19-28,32-43,47-59,89-110,112-126,128-134,140- 
148,152-161,169-184,191-204,230-235,255-264,328- 
338,341-347,401-409,413-419,433-441,449-458,463- 
468,476-482,486-492,500-506,529-545 


1:9, H:l 


305-381 


37, 254 


gbs0632 


cell wall surface 
anchor family 
protein, putative 
(FPKTG motive) 


10-29,38-45,53-61,134-145,152-160,163-170,202-208,219- 

229,248-258,266-275,282-288,315-320,328-334,377- 

385,392-402,418-424,447-453,460-471,479-487,491- 

497,500-507,531-537,581-594,615-623,629-635,644- 

652,659-666,668-678,710-717,719-728,736-741,747- 

760,766-773,784-789,794-800,805-817,855-861,866-887 


H:3 


698-715 


38, 255 


gbs0634 


putative surface 
protein 


16-26,29-37,44-58,62-68,74-80,88-95,97-120,125-144,165- 
196 


H:l 


58-72 


39, 256 


gbs0667 


regulatory protein, 
putative, truncation 


14-21,23-46,49-60,63-74,78-92,96-103,117-129,134- 
161,169-211,217-231,239-248,252-281,292-299,313-343 


1:2 


243-257 


40, 257 


gbs0672 


transcriptional 
regulator (GntR 
family) 


11-27,46-52,67-72,76-84,91-112,116-153,160-175,187- 
196,202-211,213-220 


G:ll 


43-76 


41, 258 


gbs0687 


Hypothetical protein 


5-29,37-56,78-86,108-118,152-161 


r:4 


120-130 


42, 259 


gbs0785 


Similar to penicillin 
binding protein 2B 


8-14,19-41,52-66,75-82,87-92,106-121,127-133,136- 
143,158-175,180-187,196-204,221-228,239-245,259- 
265,291-306,318-323,328-340,352-358,361-368,375- 


K:3 


184-196 


43, 260 
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t 


i81,391-399,411-418,431-442,446-455,484-496,498- 
510,527-533,541-549,558-565,575-585,587-594,644- 
355,661-668,671-677 








gbs0811 


^eta-glucosidase 


1-22,29-38,55-62,75-81,102-107,110-134,143-150,161- 

167,172-179,191-215,223-233,241-247,251-264,266- 

272,288-309,340-352,354-366,394-402,414-438 


L:13 


L98-218 


44, 261 


gbs0828 


hypothetical protein 


24-44,49-70,80-91,105-118,128-136,140-154 


1:3 


77-92 


45, 262 


gbs0851 


hypothetical protein 


5-22,31-36,41-47,67-74,83-90,105-122,135-143,160-167 


E:2 


118-129 


46, 263 


gbs0865 


hypothetical protein 


4-25,33-73,81-93,96-106,114-120,122-128,130-172,179- 
208,210-241,251-283,296-301 


K:4 


92-100 


47, 264 


gbs0890 


exonuclease RexB 
(rexB) 


14-24,29-38,43-50,52-72,86-97,101-107,110-125,127- 

141,145-157,168-175,177-184,186-195,205-226,238- 

250,255-261,284-290,293-304,307-314,316-323,325- 

356,363-371,383-390,405-415,423-432,442-454,466- 

485,502-511,519-527,535-556,558-565,569-574,612- 

634,641-655,672-686,698-709,715-722,724-732,743- 

753,760-769,783-792,818-825,830-839,842-849,884- 

896,905-918,926-940,957-969,979-1007,1015-1021,1049- 

1057 


E:5 


336-349 


48, 265 


gbs0896 


similar to acetoin 
dehydrogenase 


6-16,26-31,33-39,62-73,75-85,87-100,113-123,127- 

152,157-164,168-181,191-198,208-214,219-226,233- 

254,259-266,286-329 


K:2 


181-195 


49, 266 


gbs0898 


acetoin 

dehydrogenase, 
thymine PPi 
dependent 


4-13,32-39,53-76,99-108,110-116,124-135,137-146,149- 
157,162-174,182-190,207-231,242-253,255-264,274- 
283,291-323,334-345,351-360,375-388,418-425,456- 
474,486-492,508-517,520-536,547-560,562-577 


E:13, F:2, 1:2, 
f:2 


31-45, 419- 
443 


50, 267 


gbs0904 


phosphoglucomutase 
/phosphomannomutc 
se family protein 


15-26,30-37,42-49,58-90,93-99,128-134,147-154,174- 
1 179,190-197,199-205,221-230,262-274,277-287,300- 
314,327-333,343-351,359-377,388-396,408-413,416- 
425,431-446 


1:3 


246-256 


51, 268 


gbs0918 


weakly similar to 
histidine triad 
protein, putative 
lipoprotein 


5-26,34-42,47-54,61-67,71-104,107-115,131-138,144- 

153,157-189,196-202,204-210,228-245,288-309,316- 

329,332-341,379-386,393-399,404-412,414-421,457- 

468,483-489,500-506,508-517,523-534,543-557,565- 

580,587-605,609-617,619-627,631-636,640-646,662- 

668,675-682,705-710,716-723,727-732,750-758,784- 

789,795-809,869-874 


B:5, C:ll, 
D:36, E:3, K:3 


14-138, 166- 
286, 372-503, 
674-696, 754- 
859 


52, 269 
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gbs0931 


pyruvate kinase 


5-17,32-38,40-47,80-89413-119,125-137440-154,157- 
163,170-177,185-199,213-225,228-236,242-248,277- 
290,292-305,323-333,347-353,364-370,385-394,399- 
406,423-433,441-451,462-474,477-487 


F:78 


116-124 


53, 270 


gbs0947 


similar to L-Lactate 
Dehydrogenase 


7-16,18-30,32-49,53-61,63-85,95-101,105-115,119- 

134,143-150,159-178,185-202,212-229,236-250,254- 

265,268-294 


K:28 


63-72 


54, 271 


gbs0948 


DNA gyrase, A 
subunit (gyrA) 


4-12,19-47,73-81,97-103,153-169,188-198,207-213,217- 

223,236-242,255-265,270-278,298-305,309-317,335- 

347,354-363,373-394,419-424,442-465,486-492,500- 

507,542-549,551-558,560-572,580-589,607-614,617- 

623,647-653,666-676,694-704,706-714,748-754,765- 

772,786-792,795-806 


1:4 


358-370 


55, 272 


gbs0969 


similar to unknown 
plasmid protein 


18-28,30-38,40-46,49-55,69-78,82-98,104-134,147- 

153,180-190,196-202,218-236,244-261,266-273,275- 

286,290-295,301-314,378-387,390-395,427-434 


E:3 


290-305 


56, 273 


gbs0971 


similar to putative 
plasmid replication 
protein 


4-13,20-31,39-51,54-61,69-84,87-105,117-124 


K:17 


108-125 


57, 274 


gbs0972 


Hypothetical protein 


24-34,43-54,56-66,68-79 


E:3 


50-69 


58, 275 


gbs0983 


similar to plasmid 
protein 


5-43,71-77,102-131,141-148,150-156,159-186,191- 

207,209-234,255-268,280-286,293-299,317-323,350- 

357,363-372,391-397,406-418,428-435,455-465,484- 

497,499-505,525-531,575-582,593-607,621-633,638- 

649,655-673,684-698,711-725,736-741,743-752,759- 

769,781-793,813-831,843-853,894-905,908-916,929- 

946,953-963,970-978,1001-1007,1011-1033 


D:ll, E:2, F:2, 
J:10, K:10, 
L:46, M:3 


165-178, 818- 
974 


59, 276 


gbs0986 


surface antigen 
proteins, putative 
peptidoglycan bound 


16-44,63-86,98-108,185-191,222-237,261-274,282- 

294,335-345,349-362,374-384,409-420,424-430,440- 

447,453-460,465-473,475-504,522-534,538-551,554- 

560,567-582,598-607,611-619,627-640,643-653,655- 

661,669-680,684-690,701-707,715-731,744-750,756- 

763,768-804,829-837,845-853,855-879,884-890,910-928 


B:3, C:12, 
D:2, E:3, F:20, 
H:3,I:3,J:5, 
M:2 


77-90, 144- 
212, 279-355, 
434-536, 782- 
810, 875-902 


60, 277 


gbs0988 


similar to plasmid 
surface exclusion 
protein, putative 
peptidoglycan bound 


4-22,29-41,45-51,53-66,70-77,86-95,98-104,106-124,129- 
135,142-151,153-161,169-176,228-251,284-299,331- 
337,339-370,380-387,393-398,406-411,423-433,440- 
452,461-469,488-498,501-516,523-530,532-559,562- 


F:2 


212-227 


61, 278 
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protein (LPXTG 
motif) 


567,570-602,612-628,630-645,649-659,666-672,677- 
696,714-723,727-747 








gbs0991 


ATP-dependent Clp 
protease, ATP- 
binding subunit 
ClpA 


4-9,17-31,35-41,56-61,66-75,81-87,90-124,133-138,149- 
163,173-192,213-219,221-262,265-275,277-282,292- 
298,301-307,333-346,353-363,371-378,419-430,435- 
448,456-469,551-570,583-599,603-612 


F:15 


275-291 


62, 279 


gbs0993 


similar to plasmid 
proteins 


28-34,53-58,72-81,100-128,145-154,159-168,172-189,217- 

225,227-249,256-263,299-309,322-330,361-379,381- 

388,392-401,404-417,425-436,440-446,451-464,469- 

487,502-511,543-551,559-564,595-601,606-612,615- 

626,633-642,644-650,664-670,674-684,692-701,715- 

723,726-734,749-756,763-771,781-787,810-843,860- 

869,882-889,907-917,931-936,941-948,951-958,964- 

971,976-993,1039-1049,1051-1065,1092-1121,1126- 

1132,1145-1151,1158-1173,1181-1192,1194-1208,1218- 

1223,1229-1243,1249-1254,1265-1279,1287-1297,1303- 

1320,1334-1341,1343-1358,1372-1382,1406-1417,1419- 

1425,1428-1434,1441-1448,1460-1473,1494-1504,1509- 

1514,1529-1550 


B:2, F:2, J:4, 
K:2, M:7 


654-669, 
1400-1483 


63, 280 


gbs0995 


hypothetical protein 


10-16,20-25,58-65,97-109,118-132,134-146,148-155,186- 

195,226-233,244-262,275-284,295-310,317-322,330- 

339,345-351,366-375,392-403,408-415,423-430,435- 

444,446-457,467-479,486-499,503-510,525-537,540- 

585,602-612,614-623,625-634,639-645,650-669,700- 

707,717-724,727-739 


H:3, 1:39, J:3, 
M:3 


205-230, 733- 
754 


64, 281 


gbs0997 


hypothetical protein 


5-22,37-43,72-81,105-113,128-133,148-160,188-194,204- 
230,238-245,251-257 


D:2, F:52 


194-213 


65, 282 


gbs0998 


hypothetical protein 


16-21,35-41,56-72,74-92,103-109 


1:2 


62-68 


66, 283 


gbslOOl 


hypothetical protein 


4-15,17-82,90-104,107-159,163-170,188-221,234-245,252- 
265 


G:8 


220-235 


67, 284 


gbsl015 


hypothetical protein 


16-22,36-46,61-75,92-107,113-121,139-145,148-160 


K:17 


30-42 


68, 285 


gbsl035 


conserved 

hypothetical protein 


4-12,20-26,43-49,55-62,66-78,121-127,135-141,146- 
161,164-170,178-189,196-205,233-238,269-279,288- 
318,325-332,381-386,400-407 


E:3 


328-346 


69, 286 


gbsl041 


hypothetical protein 


5-12,31-49,57-63,69-79,89-97,99-114,116-127,134- 

142,147-154,160-173,185-193,199-204,211-222,229- 

236,243-249,256-274 


L:2 


58-68 


70, 287 



WO 2004/099242 PCT/EP2004/004856 

-69- 



S. agalactiae 
antigenic 
protein 


Putative function 
(by homology) 


predicted immunogenic aa** 


No. of 
selected 
clones per 
ORF and 

screen 


Location of 
identified 
immunogeni 
c region (aa) 


Seq. 
ID 
(DNA, 
Prot.) 


gbsl066 


hypothetical protein 


10-20,28-34,39-53,68-79,84-90,99-106 


K:2 


73-79 


71, 288 


gbsl087 


FbsA 


14-37,45-50,61-66,77-82,93-98,109-114,125-130,141- 
146,157-162,173-178,189-194,205-210,221-226,237- 
242,253-258,269-274,285-290,301-306,316-332,349- 
359,371-378,385-406 


A:7, B:2, C:4, 
E:277, G:523, 
f:25 


34-307, 312- 
385 


72, 289 


gbsllOS 


ABC transporter 

(ATP-binding 

protein) 


4-10,17-38,50-85,93-99,109-116,128-185,189-197,199- 
210,223-256,263-287,289-312,327-337,371-386,389- 
394,406-419,424-432,438-450,458-463,475-502,507- 
513,519-526,535-542,550-567 


1:5 


361-376 


73, 290 


gbslll6 


xanthine permease 
(pbuX) 


10-39,42-93,100-144,155-176,178-224,230-244,246- 
255,273-282,292-301,308-325,332-351,356-361,368- 
379,386-393,400-421 


1:48 


138-155 


74, 291 


gbsll26 


similar to plasmid 
unknown protein 


5-11,17-34,40-45,50-55,72-80,101-123,145-151,164- 

172,182-187,189-195,208-218,220-241,243-252,255- 

270,325-331,365-371,391-398,402-418,422-428,430- 

435,443-452,463-469,476-484,486-494,503-509,529- 

553,560-565,570-590,608-614,619-627,654-661,744- 

750,772-780,784-790,806-816,836-853,876-885,912- 

918,926-933,961-975,980-987,996-1006,1016-1028,1043- 

1053,1057-1062 


E:2,K:3 


994-1003, 
1033-1056 


75, 292 


gbsll43 


putative 

peptidoglycan linked 
protein (LPXTG) 


17-45,64-71,73-81,99-109,186-192,223-238,262-275,283- 

295,336-346,350-363,375-385,410-421,425-431,441- 

448,454-463,468-474,476-512,523-537,539-552,568- 

583,599-608,612-620,628-641,644-654,656-662,670- 

681,685-695,702-708,716-723,725-735,757-764,769- 

798,800-806,808-816,826-840,846-854,856-862,874- 

881,885-902,907-928 


C:3, D:2, F:15, 
r:3 


274-350, 443- 
513 


76, 293 


gbsll45 


surface exclusion 
protein SeclO 


4-22,29-41,45-51,53-61,70-76,85-92,99-104,111-122,134- 

140,142-154,163-174,224-232,255-265,273-279,283- 

297,330-335,337-348,356-367,373-385,391-396,421- 

431,442-455,475-485,493-505,526-538,544-561,587- 

599,605-620,622-651,662-670,675-681,687-692,697- 

712,714-735 


C:2 


252-262 


77, 294 


gbsll58 


Similar to 
oxidoreductase 


4-12,15-35,40-46,50-59,67-94,110-128,143-169,182- 
188,207-215,218-228,238-250 


K:2 


74-90 


78, 295 


gbsll65 


cysteine 

desulphurase (iscS-1) 


9-18,42-58,78-85,88-95,97-106,115-122,128-134,140- 
145,154-181,186-202,204-223,261-267,269-278,284- 


F:5 


12-29 


79, 296 
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293,300-336,358-368 








gbsll95 


staphylokinase and 
streptokinase 


7-34,46-53,62-72,82-88,100-105,111-117,132-137,144- j 

160,166-180,183-189,209-221,231-236,246-253,268- 

282,286-293,323-336,364-372,378-392,422-433 


B:3, C:2, D:2, 
G:3, H:8 


388-405 


80, 297 


gbsl209 


ATP-dependent 
DNA helicase PcrA 


21-27,34-50,72-77,80-95,164-177,192-198,202-220,226- 

236,239-247,270-279,285-292,315-320,327-334,348- 

355,364-371,388-397,453-476,488-497,534-545,556- 

576,582-588,601-607,609-616,642-662,674-681,687- 

697,709-715,721-727,741-755 


C:3, 1:5 


621-739 


81, 298 


gbsl214 


conserved 

hypothetical protein 


4-14,16-77,79-109 


B:2 


25-99 


82, 299 


gbsl242 


CpsG, beta-1,4- 
galactosyltransferase 


4-9,17-23,30-37,44-55,65-72,77-93,102-121,123-132,146- 
153 


L:24 


17-29 


83, 300 


gbsl260 


ABC transporter, 
ATP-binding protein 


4-18,25-41,52-60,83-92,104-112,117-123,149-155,159- 
167,170-192,201-210,220-227,245-250 


1:17 


124-137 


84, 301 


gbsl270 


gbsl270 hyaluronate 
lyase 


8-25,50-55,89-95,138-143,148-153,159-169,173-179,223- 

238,262-268,288-295,297-308,325-335,403-409,411- 

417,432-446,463-475,492-501,524-530,542-548,561- 

574,576-593,604-609,612-622,637-654,665-672,678- 

685,720-725,731-739,762-767,777-783,820-838,851- 

865,901-908,913-920,958-970,1000-1006,1009-1015,1020- 

1026,1043-1052,1055-1061 


C:19, D:5, 
L:19 


1-128, 252- 
341, 771-793, 
1043-1058 


85, 302 


gbsl305 


hypothetical protein 


16-26,33-46 


1:2 


64-76 


86, 303 


gbsl306 


Laminin binding 
protein 


4-27,69-77,79-101,117-123,126-142,155-161,171-186,200- 

206,213-231,233-244,267-273,313-329,335-344,347- 

370,374-379,399-408,422-443,445-453,461-468,476- 

482,518-534,544-553,556-567,578-595,601-620,626- 

636,646-658,666-681,715-721,762-768,778-785,789- 

803,809-819 


A:6, B:7, 
C:17, D:72, 
E:8, F:91, G:2, 
H:4,I:26,J:3, 
K:14 


22-108, 153- 
318, 391-527, 
638-757 


87, 304 


gbsl307 


Lmb, laminin- 
binding surface 
protein 


6-21,32-43,62-92,104-123,135-141,145-152,199-216,218- 
226,237-247,260-269,274-283,297-303 


A:2, D:3 


1-72, 127-211 


88, 305 


gbsl308 


C5a peptidase, 
authentic frameshift 


6-26,50-56,83-89,108-114,123-131,172-181,194-200,221- 

238,241-247,251-259,263-271,284-292,304-319,321- 

335,353-358,384-391,408-417,424-430,442-448,459- 

466,487-500,514-528,541-556,572-578,595-601,605- 

613,620-631,635-648,660-670,673-679,686-693,702- 


B:4, C:15, 
D:70, E:18, 
F:26, G:5, 
H:4,J:2,K:40 


1-213, 269- 
592, 992-1120 


89, 306 
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708,716-725,730-735,749-755,770-777,805-811,831- 
837,843-851,854-860,863-869,895-901,904-914,922- 
929,933-938,947-952,956-963,1000-1005,1008-1014,1021- 
1030,1097-1103,1120-1130,1132-1140 








gbsl309 


hypothetical protein 


9-16,33-39,47-59,65-79,81-95,103-108,115-123,138- 
148,163-171,176-185,191-196,205-211,213-221,224- 
256,261-276,294-302,357-363,384-390 


E:2, F:4, H:2, 
J:2 


95-111, 161- 
189 


90, 307 


gbsl311 


transposase, C- 
terminal part 


^1-27,35-45,70-76,92-105,129-143,145-155,161-166,170- 
191,204-211,214-231,234-246,249-255,259-275 


F:3 


1-18 


91, 308 


gbsl321 


hypothetical protein 


21-35,45-53,56-64,69-97 


P:7 


1-16 


92, 309 


gbsl352 


putative helicase and 
methylase 


25-33,41-47,61-68,86-101,106-114,116-129,134-142,144- 

156,163-176,181-190,228-251,255-261,276-292,295- 

305,334-357,368-380,395-410,424-429,454-460,469- 

482,510-516,518-527,531-546,558-570,579-606,628- 

636,638-645,651-656,668-674,691-698,717-734,742- 

754,765-770,792-797,827-835,847-859,874-881,903- 

909,926-933,942-961,964-977,989-1004,1010-1028,1031- 

1047,1057-1075,1081-1095,1108-1117,1138-1144,1182- 

1189,1193-1206,1220-1229,1239-1246,1257-1267,1271- 

1279,1284-1301,1312-1320,1329-1335,1341-1347,1358- 

1371,1399-1404,1417-1426,1458-1463,1468-1476,1478- 

1485,1493-1506,1535-1541,1559-1574,1583-1590,1595- 

1601,1603-1611,1622-1628,1634-1644,1671-1685,1689- 

1696,1715-1720,1734-1746,1766-1775,1801-1806,1838- 

1844,1858-1871,1910-1917,1948-1955,1960-1974,2000- 

2015,2019-2036,2041-2063 


E:3, H:2, M:4 


748-847, 
1381-1391 


93, 310 


gbsl356 


Putative 

peptidoglycan linked 
protein (LPXTG 
motif) - Agglutinin 
receptor 


5-12,18-24,27-53,56-63,96-113,119-124,131-136,157- 

163,203-209,215-223,233-246,264-273,311-316,380- 

389,393-399,425-433,445-450,457-462,464-470,475- 

482,507-513,527-535,542-548,550-565,591-602,607- 

613,627-642,644-664,673-712,714-732,739-764,769- 

782,812-818,826-838,848-854,860-871,892-906,930- 

938,940-954,957-973,990-998,1002-1021,1024-1033,1037- 

1042,1050-1060,1077-1083,1085-1092,1100-1129,1144- 

1161,1169-1175,1178-1189,1192-1198,1201-1207,1211- 

1221,1229-1239,1250-1270,1278-1292,1294-1300,1314- 

1335,1344-1352,1360-1374,1394-1405,1407-1414,1416- 

1424,1432-1452,1456-1462,1474-1497,1500-1510,1516- 


C:5, D:62, 1:22 


187-273, 306- 
441 


94, 311 
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1522,1534-1542,1550-1559,1584-1603,1608-1627 








gbsl376 


similar to ATP- 
dependent Clp 
proteinase (ATP- 
binding subunit), 
ClpL 


70-80,90-97,118-125,128-140,142-148,154-162,189- 
202,214-222,224-232,254-260,275-313,317-332,355- 
360,392-398,425-432,448-456,464-470,476-482,491- ? 
505,521-528,533-546,560-567,592-597,605-614,618- 
626,637-644,646-653,660-666,677-691 


K:4 


207-227 


95, 312 


gbsl377 


similar to 
homocysteine S- 
methyltransferase 


5-19,26-34,37-55,57-66,69-83,86-102,115-134,138- 
143,154-172,178-195,209-246,251-257,290-302,306-311 


M:2 


256-266 


96, 313 


gbsl386 


hydroxy-3- 
methylglutaryl- 
coenzyme A synthase 


10-20,22-28,35-57,72-79,87-103,108-128,130-144,158- 
171,190-198,225-242,274-291,301-315,317-324,374-385 


G:2 


353-365 


97, 314 


gbsl390 


hypothetical protein 


4-9,17-30,34-54,59-66,73-94,118-130,135-150,158- 
171,189-198,219-239,269-275,283-301 


E:3, K:4 


89-106, 176- 
193 


98, 315 


gbsl391 


hypothetical protein 


14-20,22-74,77-86,89-99,104-109,126-135,154-165,181- 
195,197-212,216-224,264-275 


E:3 


107-118 


99, 316 


gbsl403 


similar to 5 - 
nucleotidase, 
putative 

peptidoglycan bound 
protein (LPXTN) 


4-18,21-38,63-72,101-109,156-162,165-179,183-192,195- 

210,212-218,230-239,241-256,278-290,299-311,313- 

322,332-341,348-366,386-401,420-426,435-450,455- 

460,468-479,491-498,510-518,532-538,545-552,557- 

563,567-573,586-595,599-609,620-626,628-636,652- 

657,665-681 


A:3, C:12, 
D:4,J:2 


1-198 


100, 317 


gbsl408 


Similar to ABC 
transporter (ATP- 
binding protein) 


4-10,16-38,51-68,73-79,94-115,120-125,132-178,201- 
208,216-223,238-266,269-295,297-304,337-342,347- 
356,374-401,403-422,440-447,478-504,510-516,519- 
530,537-544 


D:2,K:4 


191-206 


101, 318 


gbsl420 


similar to cell wall 
proteins, putative 
peptidoglycan linked 
protein (LPXTG 
motif) 


12-40,42-48,66-71,77-86,95-102,113-120,129-137,141- 
148,155-174,208-214,218-225,234-240,256-267,275- 
283,300-306,313-321,343-350,359-367,370-383,398- 
405,432-439,443-461,492-508,516-526,528-535 


C:3, D:4 


370-478 


102, 319 


gbsl429 


hypothetical protein 


6-14,20-37,56-62,90-95,97-113,118-125,140-145,161- 

170,183-202,237-244,275-284,286-305,309-316,333- 

359,373-401,405-412 


B:2, C:2 


176-187 


103, 320 


gbsl442 


hypothetical 
thiamine biosynthesis 
protein, Thil 


33-44,50-55,59-80,86-101,129-139,147-153,157-163,171- 

176,189-201,203-224,239-245,257-262,281-287,290- 

297,304-320,322-331,334-350,372-390,396-401 


L:28 


71-88, 353- 
372 


104, 321 
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gbsl452 


rplT SOS ribosomal 
protein L20 


5-11,15-24,26-33,40-47,75-88,95-103,105-112 


E:2 


17-30 


105, 322 


gbsl464 


ferrichrome ABC 

transporter 

(permease) 


5-11,16-39,46-54,62-82,100-107,111-124,126-150,154- 
165,167-183,204-238,245-295,301-313,316-335 


F:4 


8-16 


106, 323 


gbsl470 


conserved 

hypothetical protein 


4-19,34-48,69-74,79-107,115-127,129-135,143-153,160- 
169,171-182 


t:4 


142-153 


107, 324 


gbsl528 


conserved 

hypothetical protein 


4-30,65-74,82-106,110-120,124-132,135-140,146-175,179- 

184,190-196,217-223,228-233,250-267,275-292,303- 

315,322-332 


1:7 


174-186 


108, 325 


gbsl529 


Putative 

peptidoglycan bound 
protein (LPXTG 
motif) 


9-16,29-41,47-57,68-84,87-109,113-119,162-180,186- 

193,195-201,203-208,218-230,234-243,265-271,281- 

292,305-312,323-332,341-347,349-363,368-374,383- 

390,396-410,434-440,446-452,455-464,466-473,515- 

522,529-542,565-570,589-600,602-613,618-623,637- 

644,1019-1027,1238-1244,1258-1264,1268-1276,1281- 

1292,1296-1302 


C:2 


883-936 


109, 326 


gbsl531 


UvrB excinuclease 
ABC chain B 


10-17,23-32,39-44,54-72,75-81,88-111,138-154,160- 
167,178-185,201-210,236-252,327-334,336-342,366- 
376,388-400,410-430,472-482,493-526,552-558,586- 
592,598-603,612-621,630-635,641-660 


M:2 


384-393 


110, 327 


gbsl533 


glutamine ABC 
transporter, 
glutamine-binding 
protein 


4-22,24-39,50-59,73-84,100-105,111-117,130-138,155- 

161,173-178,182-189,205-215,266-284,308-313,321- 

328,330-337,346-363,368-374,388-395,397-405,426- 

434,453-459,482-492,501-507,509-515,518-523,527- 

544,559-590,598-612,614-629,646-659,663-684,686- 

694,698-721 


1:4 


445-461 


111, 328 


gbsl536 


hypothetical protein 


14-22,27-33 


E:10 


3-17 


112, 329 


gbsl542 


oxidoreductase, 
aldo/keto reductase 
family 


29-41,66-73,81-87,90-108,140-146,150-159,165-184,186- 
196,216-226,230-238,247-253,261-269 


1:13 


126-140 


113, 330 


gbsl547 


small protein, SmpB 


5-12,16-25,27-33,36-45,60-68,83-88,103-126 


L:ll 


86-101 


114, 331 


gbsl565 


hypothetical protein 


14-23,36-47,56-66,84-89,94-105,111-127,140-153,160- 
174,176-183,189-203,219-225,231-237,250-257 


F:2,J:2,K:8, 
L:21 


194-227 


115, 332 


gbsl586 


peptidyl-prolyl cis- 
trans isomerase, 
cyclophilin-type 


4-25,54-60,64-71,73-82,89-106,117-124,157-169,183- 
188,199-210,221-232,236-244,255-264 


E:3 


58-98 


116, 333 
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gbsl591 


5-methylthioadenosine 
nucleosidase/S- 
adenosylhomocysteine 
nucleosidase 

(pfs) 


13-19,26-36,41-53,55-71,77-84,86-108,114-135,157- 
172,177-183,187-194,208-213,218-226 


1:18, L:2 


110-125, 156- 
170 


117, 334 


gbsl632 


similar to branched- 
chain amino acid 
ABC transporter, 
amino acid-binding 
protein 


5-24,63-69,77-85,94-112,120-137,140-146,152-159,166- 

172,179-187,193-199,206-212,222-228,234-240,244- 

252,257-264,270-289,298-309,316-328,337-348,363-375 


B:2, E:4, 1:3 


1-56, 340-352 


118, 335 


gbsl638 


amino acid permease 


18-39,42-71,78-120,124-144,152-173,179-189,199- 

209,213-222,228-258,269-304,329-361,364-372,374- 

389,396-441 


E:8, G:9, H:9 


313-327 


119, 336 


gbsl662 


conserved 

hypothetical protein 


19-25,91-98,108-120,156-162,168-174,191-204,211- 
216,232-266,272-278,286-308,316-321,327-333,344- 
355,358-364,384-391,395-428,464-476,487-495,497- 
511,544-561,563-573,575-582,588-594 


E:3, H:2 


10-25, 322- 
338 


120, 337 


gbsl666 


SWI/SNF family 
helicase 


14-26,32-49,51-57,59-72,80-91,102-112,119-125,147- 

161,164-173,175-183,188-213,217-222,246-254,260- 

276,282-303,308-318,321-328,333-350,352-359,371- 

378,392-401,407-414,416-443,448-463,471-484,490- 

497,501-514,519-527,539-551,557-570,578-590,592- 

598,600-610,618-629,633-647,654-667,676-689,702- 

709,718-726,728-737,741-760,764-780,786-795,808- 

826,836-842,845-852,865-874,881-887,931-945,949- 

957,968-974,979-986,1003-1009,1023-1029 


F:4 


90-103 


121, 338 


gbsl673 


conserved 

hypothetical protein 


11-16,37-56,60-66,69-77,80-88,93-106,117-139,166-171 


E:2 


72-90 


122, 339 


gbsl695 


dihydroxyacetone 
kinase family protein 


59-84,123-133,145-150,161-167,178-189 


[:8 


115-128 


123, 340 


gbsl754 


excinuclease ABC, A 
subunit (uvrA) 


15-33,39-46,52-64,74-87,108-124,127-144,150-156,173- 

179,184-194,201-208,219-236,243-269,272-295,302- 

309,343-349,356-361,370-379,405-411,414-423,430- 

451,457-464,466-475,477-483,496-502,507-522,541- 

548,557-563,571-577,579-585,590-605,626-642,650- 

662,671-691,704-710,751-769,775-781,786-791,794- 

829,851-858,868-878,884-904,913-919,931-939 


1:2 


132-142 


124, 341 


gbs!760 


Similar to A/G- 
specific adenine 


33-58,64-71,74-80,83-88,96-120,122-139,146-157,167- 
177,207-213,220-225,236-242,264-279,300-305,326- 


K:8 


97-115, 199- 
211 


125, 342 
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glycosylase 


336,340-347,350-360 








gbsl777 


glycerol uptake 
facilitator protein, 
putative 


4-26,43-57,70-99,102-117,121-133,142-148,151-168,170- 
183,192-220,235-249,258-279 


E:4 


30-41 i 


126, 343 


gbsl783 


polyprenyl 
synthetase family 
protein 


34-42,48-58,70-94,110-130,154-160,164-172,178-183,195- 
203,211-222,229-250,256-261,274-284,286-292,312-323 


1:3 


222-233 


127, 344 


gbsl784 


ABC transporter, 
ATP-binding protein 
CydC 


4-9,15-36,38-45,49-74,78-88,100-112,136-191,211- 
220,226-233,239-246,254-274,287-307,316-322,342- 
353,356-366,373-378,384-393,405-431,449-457,459- 
468,487-511,515-524,529-541,544-552,562-568,571-576 


C:2, D:2 


208-280 


128, 345 


gbsl790 


hypothetical protein 


10-27,31-37,39-54,71-108,124-143 


A:23, C:6 


2-107 


129, 346 


gbsl805 


Similar to secreted 
unknown protein 


16-27,38-57,64-70,90-102,104-113,116-137,160-166 


A:197, C:2 


1-80 


130, 347 


gbsl816 


HD domain protein 


13-21,31-36,56-67,127-136,153-171,173-180,184-200,214- 
222,225-231,239-263,267-273 


F:8 


135-159 


131, 348 


gbsl821 


Similar to 23S 
ribosomal RNA 
methyltransferase 


12-27,31-51,68-74,77-87,94-101,108-114,117-123,127- 

134,138-168,173-196,201-207,212-217,227-237,247- 

257,264-280 


K:5 


205-223 


132, 349 


gbsl823 


triad family protein 


17-22,25-54,70-76,92-100 


G:6, H:3 


98-110 


133, 350 


gbsl834 


two-component 
sensor histidine 
kinase 


7-29,40-50,60-67,87-96,105-111,119-164,172-199,206- 

212,220-227,237-259,272-279,282-293,295-309,313- 

319,321-328,345-363,376-386 


E:3, F:6 


159-176 


134, 351 


gbsl842 


transcriptional 
antiterminator, BglG 
family 


4-19,24-30,36-43,50-68,71-89,93-106,141-152,154- 

172,179-197,199-215,229-239,246-252,255-263,281- 

298,319-325,329-356,358-368,374-390,397-409,420- 

429,432-444,450-456,459-475,483-494,496-502,520- 

528,532-556 


1:19 


362-377 


135, 352 


gbsl850 


hypothetical 
transaldolase 


18-25,40-62,77-85,91-97,105-116,123-133,139-184,189- 
197 


G:2 


122-140 


136, 353 


gbsl869 


phosphoglycerate 
kinase 


4-49,52-58,62-70,79-105,109-133,142-150,163-168,206- 

214,220-228,233-240,243-254,274-281,303-311,327- 

338,357-373,378-396,403-413,420-436,441-453,461- 

467,475-481,484-498,506-512,514-521,523-529,562- 

579,589-595,598-603,615-648,714-722,728-742,749- 

758,777-792,795-807 


L:9 


643-658 


137, 354 


gbsl875 


alkyl hydroperoxide 


8-27,37-48,51-56,72-79,87-106,120-138,140-147,167- 


F:3 


456-470 


138, 355 
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S. agalactiae 
antigenic 
protein 


Putative function 
(by homology) 


predicted immunogenic aa** 


No. of 
selected 
clones per i 
ORF and 

screen 


Location of 
identified 
mmunogeni 
c region (aa) 


Seq. 
ID 
(DNA, 
Prot.) 


i 

< 


eductase (large ] 
jubunit) and NADH I 
dehydrogenase 


[.76,187-197,205-216,222-229,234-239,243-249,277- 
>88,292-315,334-343,347-353,363-391,398-404,430- 
147,461-467,478-492,498-507 








gbsl879 


mdopeptidase O 
[pepO) 


5-12,18-24,59-69,80-93,95-109,119-125,130-137,139- 

147,158-163,168-176,182-202,206-215,222-239,241- 

249,267-277,291-298,311-318,321-327,338-344,348- 

355,373-386,393-406,411-417,434-443,446-465,473- 

484,514-521,532-553,584-594 


[:26 


>21-237 


139, 356 


gbsl893 


2-keto-3- 

deoxygluconate 

kinase 


4-14,27-34,50-58,63-72,79-106,109-114,121-142,146- 

154,161-167,169-175,178-201,223-238,249-254,259- 

264,278-292,294-312,319-330 


F:8, K:9 


167-191 


140, 357 


gbsl899 


N-acetylmuramoyl- 
L-alanine amidase, 
family 4 protein 


7-28,36-42,50-61,63-80,122-152,161-174,176-191 


B:2, C:2, E:3 


140-190 


141, 358 


gbsl907 


citrate carrier protein, 
CCS family 


20-57,59-65,70-78,86-102,119-133,142-161,163-173,177- 

188,192-202,204-220,222-236,240-253,279-319,326- 

331,337-383,390-399,406-412,420-427,431-438 


1:2 


381-395 


142, 359 


gbsl924 


similar to 
pneumococcal 
histidine triad 
protein B precursor 
(C-terminal part) 


13-18,28-34,37-43,50-59,75-81,83-97,105-121,139- 

147,200-206,209-227,231-247,260-271,318-327,366- 

381,388-394,399-406 


K:3 


182-201 


143, 360 


gbsl925 


similar to 
pneumococcal 
histidine triad 
protein B precursor 
(N-terminal part) 


6-29,37-43,51-56,70-77,82-102,110-119,127-143,178- 

190,201-209,216-243,261-269,281-292,305-313,327- 

339,341-354,356-373,391-397,423-429,438-445,450-478 


A:2,B:5, 
C:12,D:57 


21-314 


144, 361 


gbsl962 


conserved 
hypothetical protein 


4-12,15-21,32-41,59-76,80-89,96-104 


E:3 


90-103 


145, 362 


gbs2008 


similar to C5A 
peptidase, putative 
peptidoglycan linked 
protein (LPXTG 
motif) 


9-28,30-41,44-54,69-74,77-82,90-97,104-123,125-135,149- 

155,164-173,177-184,217-226,230-235,238-244,258- 

272,282-297,300-305,309-315,317-322,327-336,348- 

362,368-374,380-387,400-411,414-424,451-458,460- 

466,483-494,497-503,506-511,521-528,540-553,569- 

587,598-606,628-642,661-681,688-700,718-733,740- 

749,752-764,769-783,823-834,848-854,862-872,878- 

884,886-898,915-920,938-951,954-961,963-972,982- 


A:253, B:2, 
C:3, D:6, H:2 


3-82, 509-576 


146, 363 
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S. agalactiae 
antigenic 
protein 


Putative function 
(by homology) 


predicted immunogenic aa** 


No. of 
selected 
clones per 
ORF and 

screen 


Location o£ 
identified 
immunogeni 
c region (aa) 


Seq. 
ID 

(DNA, 
Prot.) 






989,996-1003,1010-1016,1021-1032,1038-1044,1047- 

1057,1060-1070,1079-1088,1094-1102,1117-1127,1129- 

1135,1142-1153,1158-1204,1212-1229,1234-1263,1269- 

1277,1308-1313,1327-1338,1344-1376,1400-1415,1436- 

1443,1448-1458,1497-1504,1511-1522,1544-1566 








gbs2018 


putative 

peptidoglycan linked 
protein (LPXTG 
motif) 


8-36,40-64,71-79,88-94,102-109,118-127,138-148,151- 
159,163-174,192-198,200-206,220-233,268-273,290- 
301,304-309,316-323,331-349,378-391,414-420,427- 
437,455-475,494-510,541-547,549-555,616-640 


A:132, B:6, 
C:13, D63, 
E:15,H:2,J:9, 
K:13 


1-60, 55-139, 
212-308, 386- 
458, 458-624 


147, 364 


gbs2029 


hypothetical protein 


16-31,35-42,70-77,91-101,120-130,132-140,143-153,185- 
190,195-202,215-222,228-238,241-251,257-264,268- 
277,288-302,312-324,326-333,341-348,364-382,415- 
429,438-454,458-466,491-499,501-521 


G:8 


273-281 


148, 365 


gbs2042 


hypothetical protein 


8-14,32-57,74-149,155-177,179-212,221-266,271-296,304- 

324,329-346,349-359,368-401,413-419,426-454,465- 

478,493-510 


L:ll 


466-490 


149, 366 


gbs2054 


DNA mismatch 
repair protein HexA 


22-28,33-51,64-89,96-119,126-132,138-146,152-159,161- 

169,172-179,193-198,205-211,221-231,235-254,273- 

280,297-303,312-320,328-346,351-373,378-384,391- 

398,448-454,460-468,470-481,516-558,574-593,597- 

602,613-623,626-646,649-656,668-673,675-683,696- 

708,715-722,724-739,745-751,759-777,780-804,816-822 


E:8 


102-113 


150, 367 


gbs2058 


hypothetical protein 


12-28,41-91,98-107,112-120,125-131,151-193,215- 
221,240-250,263-280 


L:3 


128-138 


151, 368 


gbs2060 


aspartyl-tRNA 
synthetase (aspS) 


16-24,32-38,46-62,68-81,90-105,127-133,144-150,160- 

166,178-184,186-202,210-219,232-240,252-258,264- 

273,293-324,337-344,349-357,360-369,385-398,410- 

416,419-427,441-449,458-476,508-515,523-539,544- 

549,562-569,571-579 


1:3, L:12 


96-109, 127- 
139 


152, 369 


gbs2075 


hypothetical protein 


19-25,28-34,56-61,85-97,110-116 


M:2 


39-53 


153, 370 


gbs2106 


protein of unknown 
function/lipoprotein, 
putative 


4-37,41-50,62-72,91-97,99-109,114-125,136-141,149- 
158,160-166,201-215 


A:5, B:6, C:4, 
D:14, E:ll, 
1:8, K:23 


27-225 


154, 371 


gbs2118 


similar to inosine 

monophosphate 

dehydrogenase 


15-31,44-51,96-105,122-130,149-157,162-168,178- 
183,185-192,198-204,206-213,221-234,239-245,248- 
255,257-266,289-335,349-357,415-422,425-441,448- 
454,462-468 


K:17 


463-481 


155, 372 



WO 2004/099242 



PCT/EP2004/004856 



-78- 



S. agalactiae 


Putative function 


predicted immunogenic aa** 


No. of 


Location of 


Seq. 


antigenic 


(by homology) 




selected 


identified 


ID 


protein 






clones per 
ORF and 

screen 


immunogeni 
c region (aa) 


(DNA, 
Prot.) 


gbs2131 


ABC transporter, 
permease protein, 
putative 


5-31,39-55,63-72,76-99,106-155,160-177,179-199,207- 

217,223-240,245-255,261-267,294-316,321-343,354- 

378,382-452,477-488,529-536,555-569,584-591,593- 

612,620-627,632-640,647-654,671-680,698-704,723- 

730,732-750,769-775,781-788,822-852 


fc2 


505-525 


156, 373 














ARF0112 


Hypothetical protein 


none 


F:6 


3-18 


157, 374 


ARF0147 


Hypothetical protein 


4-14 


E:3, 1:3 


12-24 


158, 375 


ARF0532 


Hypothetical protein 


4-11,22-30 


F:10 


12-25 


159, 376 


ARF0534 


Hypothetical protein 


5-12 


E:2, G:2 


4-18 


160, 377 


ARF0557 


Hypothetical protein 


4-28 


E:2, G:6, H:4 


7-14 


161, 378 


ARF0862 


Hypothetical protein 


6-16 


G:7, H:4 


8-16 


162, 379 


ARF0891' 


Hypothetical protein 


4-15,18-33 


K:6 


24-36 


163, 380 


ARF0895 


Hypothetical protein 


4-10,16-21 


1:21 


20-31 


164, 381 


ARF0943 


Hypothetical protein 


none 


C:2, K:9 


6-19 


165, 382 


ARF0973 


Hypothetical protein 


11-18 


D:2, G:3, H:8, 
1:2, K:2 


3-10 


166, 383 


ARF0999 


Hypothetical protein 


13-24 


B:4, K:3 


3-15 


167, 384 


ARF1010 


Hypothetical protein 


15-27 


K:2 


7-16 


168, 385 


ARF1230 


Hypothetical protein 


11-16 


K:ll 


1-15 


169, 386 


ARF1503 


Hypothetical protein 


4-16 


E:13 


9-21 


170, 387 


ARF1556 


Hypothetical protein 


4-24,40-48,54-67 


F:2 


22-39 


171, 388 


ARF1585 


Hypothetical protein 


6-30,34-55,62-68,78-106 


1:5, J:4 


68-74 


172, 389 


ARF1588 


Hypothetical protein 


none 


1:2 


3-14 


173, 390 


ARF1735 


Hypothetical protein 


9-19 


1:13 


6-21 


174, 391 


ARF1809 


Hypothetical protein 


4-17 


H:2, L:17 


1-9 


175, 392 


ARF1826 


Hypothetical protein 


5-30 


1:6 


1-8 


176, 393 


ARP1882 


Hypothetical protein 


4-16,23-46,51-56 


K:23 


45-55 


177, 394 


ARF1996 


Hypothetical protein 


none 


F:3 


7-16 


178, 395 


CRF0123 


Hypothetical protein 


none 


F:32 


2-14 


179, 396 


CRF0180 


Hypothetical protein 


4-36,43-65 


E:6, G:6, H:12 


50-62 


180, 397 


CRF0208 


Hypothetical protein 


10-30 


1:2 


14-21 


181, 398 


CRF0258 


Hypothetical protein 


9-17 


1:2 


1-10 


182, 399 


CRF0285 


Hypothetical protein 


4-12 


F:2 


3-16 


183, 400 


CRF0311 


Hypothetical protein 


4-15 


H:4 


5-23 


184, 401 


CRF0446 


Hypothetical protein 


none 


L:20 


10-21 


185, 402 
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S. agalactiae 


Putative function 


predicted immunogenic aa** j 


No. of 


Location of 


Seq. 


antigenic 


<by homology) 




selected 


identified 


ID 


protein 






clones per 
QRF and 

screen 


immunogeni 
c region (aa) 


(DNA, 
Prot) 


CRF0455 


Hypothetical protein 


none 


F:5 


6-16 


186, 403 


CRF0491 


Hypothetical protein 


4-29,31-38 


G:4 


2-14 


187, 404 


CRF0520 


Hypothetical protein 


4-35 


H:4 


33-42 


188, 405 


CKF0530 


Hypothetical protein 


none 


G:13, H:8, K:3 


2-17 


189, 406 


CRF0570 


Hypothetical protein 


9-18,30-35 


1:2 


15-33 


190, 407 


CRF0649 


Hypothetical protein 


4-9 


G:8, H:6 


6-12 


191, 408 


CRF0853 


Hypothetical protein 


none 


1:6 


3-17 


192, 409 


CRF0955 


Hypothetical protein 


12-21,37-44,52-61,72-80 


E:7, L:44 


38-48 


193, 410 


CRF0983.1 


Hypothetical protein 


4-10,29-44,54-61,69-78 


K:59 


13-27 


194, 411 


CRF0983.2 


Hypothetical protein 


13-23,36-53 


L:33 


2-15 


195, 412 


CRF1083 


Hypothetical protein 


4-25,28-46,56-72,81-99,120-132,134-142,154-160 


F:18 


129-141 


196, 413 


CRF1095 


Hypothetical protein 


4-15,24-33,35-41,64-86 


L:15 


21-33 


197, 414 


CRF1212.1 


Hypothetical protein 


9-15 


1:5 


4-13 


198, 415 


CRF1212.2 


Hypothetical protein 


4-11,13-19,34-48 


L:30 


15-32 


199, 416 


CRF1290 


Hypothetical protein 


4-21 


1:7 


11-31 


200, 417 


CRF1383 


Hypothetical protein 


23-57 


K:13 


38-50 


201, 418 


CRF1416 


Hypothetical protein 


4-32 


E:16,J:7 


3-13 


202, 419 


CRF1500 


Hypothetical protein 


4-10,13-25,32-42,56-68,72-84 


E:16 


26-38 


203, 420 


CRF1513 


Hypothetical protein 


4-20,31-48,52-58,65-71,80-93,99-108,114-123 


1:2 


37-49 


204, 421 


CRF1518 


Hypothetical protein 


6-12,14-20 


F:28 


3-25 


205, 422 


CRF1663 


Hypothetical protein 


14-25,27-38 


F:10 


5-14 


206, 423 


CRP1667 


Hypothetical protein 


4-41,57-105,109-118,123-136,144-152 


G:4 


86-99 


207, 424 


CRF1832 


Hypothetical protein 


None 


E:5, L:8 


6-19 


208, 425 


CRF1866 


Hypothetical protein 


none 


G:3, H: 18 


2-19 


209, 426 


CRF1892 


Hypothetical protein 


14-47 


L:ll 


1-14 


210, 427 


CRF1942 


Hypothetical protein 


4-21,29-44 


F:14 


2-18 


211, 428 


CRF1992 


Hypothetical protein 


23-29 


K:10 


10-28 


212, 429 


CRF2047 


Hypothetical protein 


6-16,22-36 


K:9 


11-22 


213, 430 


CRF2050 


Hypothetical protein 


4-19,30-44 


1:2 


18-27 


214, 431 


CRP2096 


Hypothetical protein 


5-15,37-45,58-65 


G:2 


38-47 


215, 432 


CRF2113 


Hypothetical protein 


4-15,23-34 


1:5 


4-15 


216, 433 


NRF1311 


transposase, C- 
terminal part 


30-36,44-54,79-85,101-114,138-152,154-164,170-175,179- 
200,213-220,223-240,243-255,258-264,268-284 


F:3 


10-28 


217, 434 
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Table IB: Immunogenic proteins identified by amino acid sequence identity with peptides identified by bacterial 
surface display. 



s. 


Identical 


Peptide sequence 


Peptide name 


Protein 


Immunogeni 


Sequenc 


agalactiae 


region 






identified by 


c region (aa) 


elD 


antigenic 










BSD 






(DNA, 


protein 
















protein) 


(new) 


















gbs0384 


210- 


-226 


VIEYKGNFSQKTINRFKS 


gbs0995.1 


gbsuyyo 


210- 


-226 


AAQ 

4*35, 44y 




738 


-753 




gDsuyyo.z 


gosuyyD 


738 -753 




no no 

gbsu393 


326- 


-344 


\7T^TTf^Vr^1^T Tf-'T/'A 7KTT4 T-TV*\ 7 A 


gDSU!?oO.Z 


crhcflQRA 
gJDaUVOO 


326- 


-344 




326- 


-348 


VKTIGYGKLTGKVNJtiHYVANKL>G 


gDSlx43.X 


gDSll4fc3 


327 


-349 






338- 


-354 


VKTHTTVV A NTRTnr^VT A F V 

V In XTXl 1 \ f\l\X\.LJ\jDv 1-rVT V 




crbc;n986 


338 - 


-354 






371- 


-392 


A A \7MONfT\7TTI?\7T TT^Df^PPTTrFI^ 
V 1\ V^l n IVrivVH xNUOIyI JLriils. 


o-V»q1 1 4*3 9 


p-he1143 


Q79 


- cfyo 






801- 


-809 


T T \/n^k r r~"TT\.TT 
I VJJSJS.Vj11\Li 




oFidOQRfi 


801 - 


-809 






877* 


-901 


V X XJL JL X HxVOiSJ. V DX V^JlVf-111 IVttl /-vJN.VjJ_i 






877 


-901 




gbs0396 


893 


-906 


RQbLLiTTQLSKLQ 


gDsuyoo.i 


gosuyoo 


893 


-906 




gbs0407 


51 


-69 


VRYDKLEALVAYHGAKSAS 


gbs0972.1 


gbs0972 


51 


-69 


436, 452 


gbs0408 


110 


-125 


HQPNRIYLTDKLVPYI 


gbs0971.1 


gbs0971 


110 


-125 


439, 453 


gbs0410 


291 


-305 


QSIKQHDKEKLRTVL 


gbs0969.1 


gbs0969 


291 


-305 


A Af\ AC A 

440, 454 


gbs0714 


210 


-226 


MEYKGNFSQK1TNRFKS 


gbs0995.1 


gbs0995 


210 


-226 


441, 455 




738 


-753 


/— \ r T»/— \n f 1 ot/ta TTrxrt /m /— VT 

QTQIoGKINTOFMRQL 


gbsuyyo.z 


gosuyyo 


738 


-753 




gbs0723 


326 


-344 


VKT1GYGKL 1 GKVNrlrl Y V A 


gDsuyoo.z 


gosuyoo 


326 


-344 


449 4^6 




326 


-348 


VKTIGYGKLTGKVNHHWANKDG 


gbsll43.1 


gbsll43 


327 


-349 






338 


-354 


VNHHYVANKDGSVTAFV 


gbs0986.3 


gbs0986 


338 


-354 






371 


-392 


A A VNQNFVFR VLTKD GRPIF EK 


gbsll43.2 


gbsll43 


372 


-393 






801 


-809 


rVIKKGTNL 


gbs0986.4 


gbs0986 


801 


-809 






877 


-901 


VTHiTEKSKPVEPQKATPKAPAKGL 


gbs0986.5 


gbs0986 


877 


-901 




gbs0726 


893 


-906 


RQELLTPTQLSKLQ 


gbs0983.1 


gbs0983 


893 


-906 


443, 457 


gbs0737 


51 


-69 


VRYDKLEALVAYHGAKSAS 


gbs0972.1 


gbs0972 


51 


-69 


444, 458 


gbs0738 


110 


-125 


HQPNRIYLTDKLVPYI 


gbs0971.1 


gbs0971 


110 


-125 


445, 459 


gbs0740 


291 


-305 


QSIKQIiDKEKLRTVL 


gbs0969.1 


gbs0969 


291 


-305 


446, 460 


gbs0897 


32 


-44 


EGDVLLEIMSDKT 


gbs0898.1 


gbs0898 


32 


-44 


447, 461 


gbs0966 


399 -410 


PGLTVEEKFVTF 


gbs0144.1 


gbs0144 


420 


-431 


448, 462 
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Table 2. Epitope serology with human sera 



Peptides 


positivity 


3a from 


aa to 


oeq iu 


gbs0012.1 


++ 


120 


143 


218 


gbsQ012.2 


+ 


138 


161 


218 


gbs0012.3 


+ 


156 


179 


218 


gbs0016.2 


+++ 


110 


129 


219 


gbs0016.3 


+ 


168 


184 


219 


gbs0048.1 


+ 


74 


90 


222 


gbs0053.1 


+++ 


759 


773 


223 


gbs0061.1 


+++ 


237 


260 


224 


gbs0084.1 


+ 


265 


284 


225 


gbs0107.1 


++ 


65 


74 


226 


gbs0108.1 


++ 


41 


50 


227 


gbs0123.1 


+ 


163 


174 


229 


gbs0127.1 


++ 


26 


37 


230 


gbs0183.1 


+ 


174 


189 


232 


gbs0235.1 


++ 


240 


256 


234 


gbs0260.1 


+ 


285 


297 


236 


gbs0286.1 


+ 


238 


247 


238 


gbs0288.1 


+ 


491 


519 


239 


gbs0437.1 


++ 


114 


140 


243 


gbs0539.1 


+ 


267 


284 


250 


gbs0579.1 


+ 


439 


453 


252 


gbs0580.1 


++ 


162 


178 


253 


gbs0628.1 


++ 


347 


364 


254 


gbs0632.1 


+++ 


699 


715 


255 


gbs0634.1 


+ 


60 


71 


256 


gbs0667.1 


++ 


244 


257 


257 


gbs0672.1 


+ 


44 


63 


258 


gbs0672.2 


+ 


57 


76 


258 


gbs0785.1 


+ 


185 


196 


260 


gbs0851.1 


+ 


119 


129 


263 


gbs0896.1 


++ 


182 


195 


266 


gbs0898.1 


++ 


32 


44 


267 


gbs0898.2 


+ 


424 


442 


267 


gbs0904.1 


+ 


247 


256 


268 


gbs0918.1 




678 


694 


269 


gbs0918.2 


+ 


785 


805 


269 


gbs0918.4 


+ 


55 


77 


269 


gbs0918.5 


+++ 


72 


94 


269 


gbs0995.1 


+ 


210 


226 


281 


gbsl 087.3 


+ 


37 


59 


289 


gbsl 165.1 


+ 


13 


29 


296 


gbs1816.1 


+ 


136 


159 


! 348 


gbsl 821.1 


+ 


205 


222 


| 349 


gbs1823.1 


+ 


99 


110 


350 


gbs1834.1 




160 


176 


351 


gbs1875.1 


+ 


457 


470 


355 


gbsl 879.1 


+ 


221 


237 


356 


gbs1893.1 


+ 


167 


190 


357 


gbs1925.1 


+ 


96 


120 


361 


gbs2018.3 


+++ 


399 


417 


364 


gbs2018.4 


+++ 


503 


519 


364 


gbs2018.5 


+++ 


544 


563 


364 


gbs2106.2 


+ 


46 


68 


371 


gbs2106.7 


+ 


159 


183 


371 


gbs2106.8 


+ 


184 


198 


371 


gbs2118.1 


++ 


463 


481 


372 
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Table 3: Gene distribution in S. agalactiae strains. 



ORP 


Common name 


Gene distribution 
(present of 50) 


Amino acid 
substitutions (in 
serotype IA strain)" 


Seq. 
ID (DNA, 
Prot.) 


gbs0012 


weakly similar to beta-lactamase 


44/44 


n.d. 


1, 218 


gbs0016 


glucan-binding protein B (S.mutans) 


40/44 


0/224 


2, 219 


gbs0024 


Phosphoribosylformylglycinamidine 


46/46 


~10/228 


3,220 


gbs0031 


surface immunogenic protein 


46/46 


1/225 


4,221 


gbs0048 


Unknown 


30/46 


0/61 


5, 222 


gbs0053 


aldehyde-alcohol dehydrogenase (adhE) 


45/45 


0/224 


6, 223 


gbs0061 


rplB ribosomal protein L2 


46/46 


0/218 


7,224 


gbs0084 


DNA-directed RNA polymerase, alpha 
subunit (rpoA) 


45/45 


0/207 


8, 225 


gbs0107 


conserved hypothetical protein 


46/46 


0/235 


9,226 


gbs0108 


deoxyuridine 5 '-triphosphate 
nucleotidohydrolase 


44/44 


0/125 


10, 227 


gbs0113 


ribose ABC transporter 


44/45 


0/227 


11, 228 


gbs0123 


similar to argininosuccinate synthase 


44/44 


0/184 


12, 229 


gbs0127 


rpmV SOS ribosomal protein L28 


46/46 


0/40 


13, 230 


gbs0144 


oligopeptide ABC transporter, substrate- 
binding 


45/45 


0/282 


14, 231 


gbs0183 


membrane protein, putative 


44/44 


0/223 


15, 232 


gbs0184 


oligopeptide ABC transporter, 
oligopeptide-binding 


46/46 


1/203 


16, 233 


gbs0235 


1 ♦ 1 » * l * » * til* A ~r"> / — i 

glycme betaine/carnitme/cholme ABC 
transporter 


46/46 


0/219 


17, 234 


gbs0255 


conserved hypothetical protein 


46/46 


0/180 


18, 235 


gbs0260 


glycyl-tRNA synthetase (beta subunit 


46/46 


0/209 


19, 236 


gbs0268 


transketolase (tkt) 


46/46 


0/208 


20, 237 


gbs0286 


NH3-dependent NAD+ synthetase 


45/45 


0/191 


21, 238 


gbs0288 


similar to penicillin-binding protein 1A 


45/45 


0/212 


22, 239 


gbs0343 


seryl-tRNA synthetase (serS) 


46/46 


0/228 


23, 240 


gbs0428 


similar to fibrinogen binding protein, 
putative peptadoglycan linked protein 
(LPXTG motif) 


45/46 


1/126 


25,242 


gbs0437 


glucose-6-phosphate isomerase (pgi) 


45/45 


0/232 


26, 243 


gbs0460 


decarboxylase 


46/46 


1/81 


27,244 
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ORF 


Common name 


Gene distribution 
(present of 50) 


Amino acid 
substitutions (in 
serotype IA strain)* 


Seq. 
ID (DNA, 
Prot.) 


gbs0465 


oxydoreductase 


46/46 


0/126 


28, 245 


gbs0489 


acetyltransferase, GNAT family 


45/45 


3/144 


30, 247 


gbs0492 


gbs0492 valyl-tRNA synthetase 


44/44 


3/125 


31, 248 


gbs0538 


amino acid ABC transporter (ATP-binding 
protein) 


46/46 


0/214 


32, 249 


gbs0539 


similar to phosphomannomutase 


46/46 


0/244 


33, 250 


gbs0555 


beta-lactam resistance factor (fibA) 


46/46 


0/218 


34, 251 


gbs0579 


dipeptidase 


46/46 


0/218 


35, 252 


gbs0580 


zinc ABC transporter, zinc-binding 
adhesion liprot 


45/45 


2/235 


36, 253 


gbs0628 


cell wall surface anchor family protein - 
(IPxTG) 


42/44 


0/219 


37, 254 


gbs0632 


cell wall surface anchor family protein, 
putative (FPKTG motive) 


44/45 


0/238 


38, 255 


gbs0667 


regulatory protein, putative, truncation 


44/44 


0/229 


40, 257 


gbs0672 


transcriptional regulator (GntR family) 


43/43 


0/203 


41, 258 


gbs0687 


unknown proteins 


45/45 


0/149 


42, 259 


gbs0785 


Similar to penicillin binding protein 2B 


45/45 


0/218 


43, 260 


gbs0828 


unknown proteins 






45, 262 


4o/4b 


1 /ion 


gbs0851 


hypothetical protein 


46/46 


0/140 


46, 263 


gbs0865 


gbs0865 Unknown 


44/44 


0/241 


47, 264 


gbs0890 


exonuclease RexB (rexB) 


46/46 


0/232 


48, 265 


gbs0896 


similar to acetoin dehydrogenase 


46/46 


0/239 


49, 266 


gbs0898 


acetoin dehydrogenase, thymine PPi 
dependent 


45/45 


0/180 


50, 267 


gbs0904 


phosphoglucomutase/phosphomannomuta 
se family prote 


46/46 


0/169 


51, 268 


gbs0918 


weakly similar to histidine triad protein, 
putative lipoprotein 


45/45 


1/209 


52, 269 


gbs0931 


pyruvate kinase 


46/46 


0/185 


53, 270 




ciTnilm* in T -T ssffrn+o T*)fslr\7rl rncrpn a co 
ollllllcil lU l_> l_jcil_lcllc JU/cIiy UlUgcllaoc 


46/46 


0/233 


54, 271 


gbs0948 


DNA gyrase, A subunit (gyrA) 


44/44 


0/172 


55, 272 




conserved hypothetical protein 


46/46 


0^10 


□7, ZOO 


gbsl066 


gbsl066 Unknown 


17/46 


2/92 


71, 288 


gbsl087 


highly repetitive peptidoglycan bound 
protein (LPXTG motif) 


42/45 


n.d. 


72, 289 
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ORF 


Common name 


Gene distribution 
(present of 50) 


Amino acid 
substitutions (in 
serotype IA strain)* 


Seq. 
ID (DNA, 
Prot.) 


gbsll03 


ABC transporter (ATP-binding protein) 


46/46 


1/165 


73, 290 


gbslll6 


xanthine permease (pbuX) 


,,.«.., .... . 

45/45 


1/170 


74,291 


gbsll58 


Similar to oxidoreductase 


44/44 


1/170 


78, 295 


gbsll65 


cysteine desulphurase (iscS-1) 


43/43 


0/148# 


79, 296 


gbsll95 


staphylokinase and streptokinase 


45/45 


60/142 


80, 297 


gbsl209 


ATP-dependent DNA helicase PcrA 


43/44 


l/94# 


81, 298 


gbsl214 


conserved hypothetical protein 


43/46 


0/97 


82, 299 


gbsl260 


ABC transporter, ATP-binding protein 


44/46 


1/198 


84, 301 


gbsl306 


Laminin binding protein (Spellerberg,B et 
al 1999) 


45/46 


0/215 


87, 304 


gbsl307 


lmb laminin-binding surface protein 


45/45 


n.d. 


88, 305 






gbsl308 


C5a peptidase, authentic frameshift 


46/46 


0/205 


89, 306 


gbsl309 


hypothetical protein 


44/46 


0/214 


90,307 


gbsl356 


Putative peptidoglycan linked protein 
(LPXTG motif) - Agglutinin receptor 


20/46 


50/211# 


94, 311 


gbsl376 


similar to ATP-dependent Clp proteinase 
(ATP-binding subunit), ClpL 


45/45 


0/197 


95, 312 


gbsl377 


similar to homocysteine S- 
methyltransferase 


45/45 


0/55 


96,313 


gbsl386 


-hydroxy-3-methylglutaryl-coenzyme A 
synthase 


44/44 


0/219 


97, 314 


gbsl390 


gbsl390 Unknown 


43/43 


0/198 


98, 315 


gbsl391 


gbsl391 Unknown 


44/44 


0/214 


99,316 


gbsl403 


similar to 5-nucleotidase, putative 
peptidoglycan bound protein (LPXTN) 


45/45 


3/189 


100, 317 


gbsl408 


Similar to ABC transporter (ATP-binding 
protein) 


45/45 


0/205 


101, 318 


gbsl429 


unknown proteins 


46/46 


1/193 


103,320 


gbsl452 


rplT SOS ribosomal protein L20 


46/46 


0/101 


105, 322 


gbsl464 


ferrichrome ABC transporter (permease 


44/44 


2/232 


106, 323 


gbsl470 


conserved hypothetical protein 


46/46 


2/164 


107, 324 


gbsl528 


conserved hypothetical protein 


45/45 


0/213 


108,325 


gbsl531 


UvrB excinuclease ABC chain B 


45/45 


0/108 


110, 327 


gbs!533 


glutamine ABC transporter, glutamine- 
binding prote 


44/44 


0/166 


111, 328 
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ORF 


Common name 


Gene distribution 
(present of 50) 


Amino acid 
substitutions (in 
serotype IA strain)* 


Seq. 
ID (DNA, 
Prot.) 


gbsl542 


oxidoreductase, aldo/keto reductase family 


45/45 


1/219 


113, 330 


gbsl565 


hypothetical protein 


43/43 


1/218 


115, 332 


gbsl586 


peptidyl-prolyl cis-trans isomerase, 
cyclophilin-type 


45/45 


1/227 


116, 333 


gbsl591 


5-methylthioadenosine nucleosidase/S- 
adenosylhomoc 


45/45 


0/203 


117, 334 


gbsl632 


similar to branched-chain amino acid ABC 
transporter, amino acid-binding protein 


45/45 


0/223 


118, 335 


gbsl638 


amino acid permease 


45/45 


0/100 


119, 336 


gbsl662 


conserved hypothetical protein 


45/45 


0/213 


120, 337 


gbsl666 


SWI/SNF family helicase 


45/45 


0/200 


121, 338 


gbsl673 


conserved hypothetical protein 


45/45 


0/147 


122, 339 


gbsl695 


dihydroxyacetone kinase family protein 


43/43 


1/165 


123, 340 


gbsl754 


excinuclease ABC, A subunit (uvrA) 


43/43 


0/224 


124, 341 


gbsl760 


Similar to A/G-specific adenine glycosylase 


46/46 


0/181 


125, 342 


gbsl777 


glycerol uptake facilitator protein, putative 


43/43 


0/199 


126, 343 


gbsl783 


polyprenyl synthetase family protein 


45/45 


0/217 


127, 344 


gbsl784 


ABC transporter, ATP-binding protein 
CydC 


45/45 


1/220 


128, 345 


gbsl790 


unknown proteins 


41/43 


3/75# 


129, 346 


gbsl805 


Similar to secreted unknown proteins 


45/45 


0/66 


130, 347 


gbsl816 


HD domain protein 


43/43 


1/176 


131, 348 


gbsl821 


Similar to 23S ribosomal RNA 
methyltransferase 


43/43 


2/155# 


132, 349 


gbsl834 


two-component sensor histidine kinase 


44/44 


0/213 


134, 351 


gbsl842 


transcriptional anti terminator, BglG family 


43/43 


0/208 


135, 352 


gbsl850 


hypothetical transaldolase 


44/44 


0/194 


136, 353 


gbsl875 


alkyl hydroperoxide reductase (large 
subunit) and NADH dehydrogenase 


46/46 


0/192 


138, 355 


gbsl879 


endopeptidase O (pepO) 


43/43 


0/135 


139, 356 


gbsl893 


2-keto-3-deoxygluconate kinase 


36/46 


0/228 


140, 357 


gbsl899 


N-acetylmuramoyl-L-alanine amidase, \ 
family 4 prote 


43/43 


0/149 


141, 358 


gbsl907 


citrate carrier protein, CCS family 


43/43 


0/219 


142, 359 


gbsl925 


similar to pneumococcal histidine triad 
protein B precursor (N-terminal part) j 


43/43 


0/103 


144, 361 


gbsl962 


conserved hypothetical protein 


28/46 


0/136 


145, 362 


gbs2008 


similar to C5A peptidase, putative 


43/43 


n.d. 


146, 363 
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ORF 


Common name 


Gene distribution 
(present of 50) 


Amino acid 
substitutions (in 
serotype IA strain)* 


Seq. ! 
ID (DNA, 
Prot.) 




peptidoglycan linked protein (LPXTG 
motif) 








gbs2018 


putative peptidoglycan linked protein 
(LPXTG motif) 


43/45 


0/104 


147, 364 


gbs2029 


unknown proteins 


44/44 


1/238 


148, 365 


gbs2054 


DNA mismatch repair protein HexA 


46/46 


0/206 


150, 367 


gbs2060 


aspartyl-tRNA synthetase (aspS) 


46/46 


2/211 


152, 369 


gbs2106 


protein of unknown function/lipoprotein, 
putative 


44/44 


0/160 


154, 371 


gbs2118 


similar to inosine monophosphate 
dehydrogenase 


43/43 


0/113 


155, 372 


gbs2131 


ABC transporter, permease protein, 
putative 


45/45 


0/237 


156, 373 
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Table 4. Immunogenicity of epitopes in mice 



ORF s 


aa from < 


aa to 


IB 


Peptide 
ELISA 


Seq ID 


gbs0016 


110 


129 


+ 




219 




168 


184 


+ 






gbs0986 


877 


901 


+ 




277 




333 


354 


+ 








326 


344 


+ 








801 


809 


+ 






gbs1805 


1 


54 




+++++ 


347 


gbs2018 


544 


563 


++ 


+++++ 


364 




31 


51 


+ 


+++ 






107 


A Ad 

iiy 


+ 








QQQ 

oyy 


417 


++ 


4.4.4-4. 








519 


+ 


4.4,4.4. 




gusuu i z 




198 


++ 


+ 


218 


gosuu i o 


90 


OK 

oo 


+ 


++ 


219 


gbs0031 


•1 A Q 
11 O 


ZU1 




4.4.4.4. 


991 


gbs0428 


A O 

4o 


1oZ 


+ 


4.4.4.4.4. 


949 


gbs0538 


A A O 

118 


A OC 

1oO 


+ 


4.0.4.4. 

TTTT 


94Q 


gbs0580 


162 


17o 


+ 


1 

f 


ZOO 


gbs0628 


347 


364 


+ 


4.4.4.4.4. 


Z04 


gbs0632 


699 


715 


+ 


4.4.4.4.4. 


255 


gbs0672 


50 


76 




+ 


258 


gbs0918 


785 


819 


+ 


4.4.4.4.4. 


269 




44 


128 


++ 






gbs0971 


90 


128 




4.4.4.4.4. 


0~7A 


gbs1087 


314 


384 


+ 




zoy 


gbs1143 


327 


349 




4.4.4. 




gbs1306 


242 


n a A 

314 


++ 


4.4.4.4. 


*304 






47 ft 


++ 








23 


100 


+ 






yuo i ou / 


129 


210 


++ 




305 






188 




4.4. 


307 


nhol ^R9 


750 


772 


++ 


4.4.4.4.4. 


310 






56 




+ + 


335 


nhc-l ftft9 

gos i doz 


^99 


00 / 


+ 


4.4.4.4.4. 


337 


yuo iui u 


72 


90 


+ 


4.4.4.4.4. 


339 


gbs1784 


374 


395 




+ 


345 


gbs1816 


i3e 


15S 


+ 


+ 4.4.4. 


348 


gbs1899 


141 


164 


+ 




358 


gbs1925 


9e 


157 




+ 


361 


gbs2008 


1 


82 




+ 


363 


gbs2018 


48£ 


) 55e 


> + 


4.4.4.4.4. 


364 


gbs2106 


15€ 


) 18c 




++ 


371 




4c 


) 13c 


J + 


4,4.4.4.4. 





WO 2004/099242 

-88- 

Table 5: Immunogenic proteins identified by bacterial surface display. 



PCT/EP2004/004856 



S. agalactiae 
antigenic 
protein 


Putative function 
(by homology) 


predicted immunogenic aa** 


No. of 
selected 
clones per 
ORF and 
screen 


Location of 
identified 
immunogenic 
region (aa) 


Seq. 
ID 

(DNA, 
Prot.) 


gbs0233 


similar to glycine 
betaine/carnitine/choli 
ne ABC transporter 
(osmoprotectant- 
binding protein) 


4-26, 35-41, 53-61, 73-84, 103-108, 114-120, 140-146, 
156-162, 192-208, 214-219, 227-233, 239-252, 260-268, 
284-297 


A:5, B:9, 
H:3 


1-48, 113-133 


463, 475 


gbs0419 


protein of unknown 
function/lipoprotein 


4-27, 38-44, 50-56, 59-64, 72-79, 83-89, 92-97, 108-116, 
123-148, 152-167, 183-196, 200-220, 232-244, 255-261, 
265-274, 282-302, 309-317 


C:3, D:4, 
r:2,K:4 | 


1-79, 231-302 


464, 476 


gbs0456 


cell wall surface 1 
anchor family protein 


6-28, 66-72, 85-105, 115-121, 144-151, 160-170, 176- 
185, 223-230, 252-288, 296-310, 319-333, 367-374, 458- 
464, 471-480, 483-488, 520-528, 530-549, 559-564, 593- 
601, 606-616, 636-643, 655-662, 676-682, 684-699, 719- 
726, 735-750, 757-764, 777-785, 799-810, 812-843, 846- 
853, 868-873, 880-889, 891-899, 909-929, 934-940, 963- 
969, 998-1004, 1007-1014, 1016-1022, 1030-1046 


A:9, B:125, 
F:9 


1-80, 808-821 


465, 477 


gbs0942 


protein of unknown 
function/lipoprotein 


7-24, 35-41, 75-81, 91-114, 122-132, 137-144, 148-156, 
183-192, 194-200, 212-228, 233-238, 251-258, 275-295, 
326-332, 337-346 


A:3, B:9, 
C:2,J:3 


1-79, 305-321 


466, 478 


gbs0973 


gid protein (gid) 


31-38, 42-52, 66-72, 86-92, 98-104, 115-122, 127-146, 
154-164, 169-187, 198-212, 225-237, 255-269 


D:2, G:3, 

H:8,I:2, 

K:2 


13-92, 135-142 


467, 479 


gbs0975 


Unknown protein 


4-36, 39-49, 63-69, 71-77, 81-88, 123-131, 133-139, 160- 
169, 174-180, 188-194, 210-217, 273-278, 289-300, 317- 
334, 336-341, 383-401, 425-438 


A:3, B:5, 
C:5,D:2, 
1:2 


1-68, 212-270, 
402-446 


468, 480 


gbsl038 


permease, putative 


21-29, 31-42, 49-63, 72-79, 81-93, 112-132, 159-165, 
188-195, 197-232, 262-267, 279-286, 294-301, 318-326, 
348-366, 381-405, 409-426, 436-465, 471-480, 484-492, 
497-505, 521-544, 554-561, 567-577, 581-589, 601-609, 
611-622, 636-651, 653-667, 669-685, 700-708, 716-722, 
729-744, 749-766, 780-786, 789-811, 814-864 


B:2, C:3, 
E:3,J:2 


1-57, 84-106 


469, 481 


gbsll44 


Unknown protein 


6-24, 35-48, 57-63, 72-78, 87-92, 113-119, 123-137, 147- 
153, 173-181, 212-233 


C:30 


1-124 


470, 482 


gbsl279 


conserved domain 
protein 


13-34, 62-69, 78-83, 86-91, 98-104, 107-115, 146-159, 
179-188, 195-205, 209-221, 226-233, 239-253, 276-282, 
284-294, 297-308, 331-354, 375-382, 388-399, 421-433, 


A:44, B:2, 
C:4 


1-84 


471, 483 
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S. agalactiae 
antigenic 
protein 


Putative function 
(by homology) 


predicted immunogenic aa** 


No. of 
selected 
clones per 
ORF and 

screen 


Location of 
identified \ 
immunogenic 
region (aa) 


Seq. 

ID 
(DNA, 
Prot.) 






449-458, 464-469, 472-491, 508-513, 525-531, 534-550, 
575-593, 601-618, 629-635, 654-661, 666-680, 706-721, 
723-740, 771-805, 810-830, 845-851 








gbsl441 | 


conserved 

hypothetical protein 


4-32, 45-64, 73-83, 86-92, 100-111, 125-147, 157-163, 
170-175, 177-188, 226-232, 245-252, 258-274, 320-335, 
348-359 


A:2, B:8, 
C:2 


1-71 


472, 484 


gbsl677 


sodium transport 
family protein 


13-40, 43-71, 76-83, 87-101, 109-119, 125-156, 162-175, 
182-219, 226-232, 240-262, 270-287, 306-318, 326-342, 
344-408, 414-444, 449-456 


B:3, C:2, 
K:5 


1-51 


473, 485 


gbs2093 


Membrane protein, 
putative 


4-16, 18-34, 45-54, 99-108, 134-140, 203-212, 241-257, 
266-274, 279-291, 308-315, 330-336, 355-370, 374-382, 
402-410, 428-455, 466-472, 474-480, 531-554, 560-566, 
572-580, 597-618, 632-660, 664-674, 676-685, 691-705, 
708-735, 750-768 


B :5, C :8, 
D:2 


1-87, 342-480 


474, 486 



Table 6. Immune reactivity of epitopes within identified antigens with human sera 



Peptides 


Positivity 


aa from 


aa to 


Seq ID 


gbs0233.1 




115 


132 


475 


gbs0233.2 


+ 


1 


26 


475 


gbs0419.1 


++ 


33 


55 


476 


gbs0942.2 


+++ 


1 


25 


478 


gbs0973.1 


+ 


37 


61 


479 


gbs0975.1 


+ 


1 


24 


480 


gbsl 038.1 


+ 


1 


23 


481 


gbs 1144.3 




46 


60 


482 


gbsl 279.1 


+++ 


1 


28 


483 


gbsl 279.2 


+ 


23 


50 


483 


gbs1279.3 




45 


71 


483 


gbsl 441.1 


+ 


1 


22 


484 


gbs1441.2 


++ 


17 


38 


484 


gbsl 677.1 




1 


22 


485 


gbsl 677.2 


+ 


17 


j 38 


485 


gbs2093.1 


+ 


1 


27 


486 


gbs2093.2 




22 


! 47 


486 


gbs2093.4 


+ 


I 422 


447 


486 



WO 2004/099242 



PCT/EP2004/004856 



-90- 



Table 7. Immunogenicity of epitopes in mice 



ORF 


aa from 


aa to 


Peptide 
ELISA 


Seq ID 


gbs0233 ; 


115 


132 


++++ 


475 




1 


47 


++ 






1 


55 


++++ 


476 


gbs0456 


22 


85 


++ 


477 


gbs0942 


307 


320 


+ 


478 




1 


44 


++ 




gbs0973 


15 


| 76 


++ 


479 




40 


92 


+++ 




gbs0975 


1 


59 


++ 


480 




213 


269 


+ 






403 


445 


+ 




gbs1038 


! 1 


i 56 


++++ 


481 




85 


105 


+ 




gbs1144 


37 


121 


+++++ 


482 


gbs1279 


1 


71 


+++++ 


483 


gbs1441 


I 1 


| 38 


+++ 


| 484 


gbs1677 


! 1 


38 


+++ 


485 




f! 1 


47 


+++ 


486 



WO 2004/099242 



PCT/EP2004/004856 



-91 - 

References 



Altschul, S., et al. (1990). Tournal of Molecular Biology 215: 403-10. 

Baiter, S. et al. In Gram positive pathogens ed. by Fischetti V.A. et al. AMS Press 2000, 154-160. 

Bennett, D., et al. fl995). T Mol Recognit 8: 52-8. 

Brodeur, B., et al. (2000). Infect Immun 68: 5610-8. 

Burnie, J., et al. (1998). T Antimicrob Chemother 41: 319-22. 

Campbell, J., et al. (2000). Obstet Gynecol 96: 498-503. 

Cheng, Q., et al. (2002). Infect Immun 70: 6409-15. 

Clackson, T„ et al. (1991). Nature 352: 624-8. 

Devereux, J., et al. (1984). Nucleic acids research 12: 387-95. 

Doherty, E., et al. (2001). Annu Rev Biophys Biomol Struct 30: 457-475. 

Eisenbraun, M., et al. (1993). DNA Cell Biol 12: 791-7. 

Etz, H., et al. (2001). T Bacteriol 183: 6924-35. 

Farley, M. (2001). Clin Infect Pis 33: 556-61. 

Ganz, T. (1999). Science 286: 420-421. 

Georgiou, G. (1997). Nature Biotechnology 15: 29-34. 

Glaser, P., et al. (2002). Mol Microbiol 45: 1499-513. 

Hashemzadeh-Bonehi, L., et al. (1998). Mol Microbiol 30: 676-678. 

Heinje, von G (1987) e.g. Sequence Analysis in Molecular Biology, Academic Press. 

Hemmer, B., et al. (1999). Nat Med 5: 1375-82. 

Hoe, N., et al. (2001). T Infect Pis 183: 633-9. 

Hornef, M v et al. (2002). Nat Immunol 3: 1033-40. 

Jackson, L., et al. (1995). Ann Intern Med 123: 415-20. 

Johanson, K., et al. (1995). T Biol Chem 270: 9459-71. 

Jones, P., et al. (1986). Nature 321: 522-5. 

Kajava, A., et al. (2000). T Bacteriol 182: 2163-9. 

Kohler, G., et al. (1975). Nature 256: 495-7. 

Larsson, C, et al. (1999). Vaccine 17: 454-8. 

Lewin, A., et al. (2001). Trends Mol Med 7: 221-8. 

Marks, J., et al. (1992). Biotechnology (N 10: 779-83. 

McCafferty, J., et al. (1990). Nature 348: 552-4. 

Michel, J., et al. (1991). Infect Immun 59: 2023-8. 

Navarre, W., et al. (1999). Microbiol Mol Biol Rev 63: 174-229. 

Nizet, V. & Rubens, C.E. in Gram positive pathogens ed. by Fischetti V.A. et al. ASM Press 2000, 
pp 125-136. 

Okano, H., et al. (1991). T Neurochem 56: 560-7. 

Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression; CRC Press, Boca Ration, FL (1988). 
Paoletti, L., et al. (2002). Semin Neonatol 7: 315-23. 

Paoletti, L., et al. In Gram positive pathogens ed. by Fischetti V.A. et al. ASM Press 2000, pp 137-153. 

Phillips-Quagliata, J., et al. (2000). T Immunol 165: 2544-55. 

Rammensee, H., et al. (1999). Immunogenetics 50: 213-9. 

Seeger, C, et al. (1984). Proc Natl Acad Sci U S A 81: 5849-52. 

Shibuya, A., et al. (2000). Nature Immunology 1: 441-6. 

Skerra, A. (1994). Gene 151: 131-5. 

Tang, P., et al. (1992). Nature 356: 152-4. 

Tempest, P., et al. (1991). Biotechnology (N Y) 9: 266-71. 

Tettelin, H„ et al. (2002). Proc Natl Acad Sci US A 99: 12391-6. 

Tourdot, S., et al. (2000). Eur T Immunol 30: 3411-21. 

Wiley, J., et al. (1987) Current Protocols in Molecular Biology. 



WO 2004/099242 



PCT/EP2004/004856 



-92- 

Claims: 

1. An isolated nucleic acid molecule encoding a hyperimmune serum reactive antigen or a fragment 
thereof comprising a nucleic acid sequence, which is selected from the group consisting of: 

a) a nucleic acid molecule having at least 70% sequence identity to a nucleic acid molecule selected 

from Seq ID No 14, 90, 157-216, 
h) a nucleic acid molecule which is complementary to the nucleic acid molecule of a), 

c) a nucleic acid molecule comprising at least 15 sequential bases of the nucleic acid molecule of a) 
or b) 

d) a nucleic acid molecule which anneals under stringent hybridisation conditions to the nucleic 
acid molecule of a), b), or c) 

e) a nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nucleic acid molecule defined in a), b), c) or d). 

2. The isolated nucleic acid molecule according to claim 1, wherein the sequence identity is at least 
80%, preferably at least 95%, especially 100%. 

3. An isolated nucleic acid molecule encoding a hyperimmune serum reactive antigen or a fragment 
thereof comprising a nucleic acid sequence selected from the group consisting of 

a) a nucleic acid molecule having 96 % or more than 96 %, preferably at least 98 %, especially 100 % 
sequence identity to a nucleic acid molecule selected from Seq ID No 1, 3, 5-13, 15, 18-25, 27-31, 
33-36, 39-68, 70-85, 92-100, 103-126, 128-145, 147, 149-156, 217, 435-448 and 463-474. 

b) a nucleic acid molecule which is complementary to the nucleic acid molecule of a), 

c) a nucleic acid molecule comprising at least 15 sequential bases of the nucleic acid molecule of a) 
or b) 

d) a nucleic acid molecule which anneals under stringent hybridisation conditions to the nucleic 
acid molecule of a), b) or c), 

e) a nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nucleic acid defined in a), b), c) or d). 

4. An isolated nucleic acid molecule encoding a hyperimmune serum reactive antigen or a fragment 
thereof comprising a nucleic acid sequence selected from the group consisting of 

a) a nucleic acid molecule having 98 % or more than 98%, escpecially 100 % sequence identity to a 
nucleic acid molecule selected from Seq ID No 32, 86, 91, 101, 127, 

b) a nucleic acid molecule which is complementary to the nucleic acid of a), 

c) a nucleic acid molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nucleic acid defined in a), b), c) or d). 

5. The nucleic acid molecule according to any one of the claims 1, 2, 3 or 4, wherein the nucleic acid is 
DNA. 

6. The nucleic acid molecule according to any one of the claims 1,2, 3, 4, or 5 wherein the nucleic acid 
is RNA. 

7. An isolated nucleic acid molecule according to any one of claims 1 to 5, wherein the nucleic acid 
molecule is isolated from a genomic DNA, especially from a S. agalactiae genomic DNA. 

8. A vector comprising a nucleic acid molecule according to any one of claims 1 to 7. 

9. A vector according to claim 8, wherein the vector is adapted for recombinant expression of the 
hyperimmune serum reactive antigens or fragment thereof encoded by the nucleic acid molecule 
according to any one of claims 1 to 7. 
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10. A host cell comprising the vector according to claim 8 or 9. 

11. A hyperimmune serum-reactive antigen comprising an amino acid sequence being encoded by a 
nucleic acid molecule according to any one of the claims 1, 2, 5, 6 or 7 and fragments thereof, 
preferably the amino acid sequence is selected from the group consisting of Seq ID No 231, 307, 
374-433. 

12. A hyperimmune serum-reactive antigen comprising an amino acid sequence being encoded by a 
nucleic acid molecule according to any one of the claims 3, 5, 6, or 7 and fragments thereof, 
preferably the amino acid sueqnece is selected from the group consisting of Seq ID No 218, 220, 
222-230, 232, 235-242, 244-248, 250-253, 256-285, 287-302, 309-317, 320-343, 345-362, 364, 366-373, 
434, 449-462 and 475-486. 

13. A hyperimmune serum-reactive antigen comprising an amino acid sequence being encoded by a 
nucleic acid molecule according to any one of the claims 4, 5, 6, or 7 and fragments thereof, 
preferably the amino acid sequence is selected from the group consisting of Seq ID No 249, 303, 
308, 318, 344. 



14. Fragments of hyperimmune serum-reactive antigens selected from the group consisting of peptides 
comprising amino acid sequences of column "predicted immunogenic aa" and "location of 
identified immunogenic region" of Table 1A, especially peptides comprising amino acid 4-20, 35- 
44, 65-70, 73-87, 92-98, 112-137, 152-161, 177-186, 193-200, 206-213, 229-255, 282-294, 308-313, 320- 
326, 349-355, 373-384, 388-406, 420-425 and 115-199 of Seq ID No 218; 5-24, 35-41, 44-70, 73-89, 103- 
109, 127-143, 155-161, 185-190, 192-207, 212-219, 246-262, 304-336, 372-382, 384-393, 398-407, 412-418, 
438-444, 1-75, 76-161 and 164-239 of Seq ID No 219; 4-10, 16-58, 60-71, 77-92, 100-126, 132-146, 149- 
164, 166-172, 190-209, 214-220, 223-229, 241-256, 297-312, 314-319, 337-343, 351-359, 378-387, 398-418, 
421-428, 430-437, 440-448, 462-471, 510-519, 525-536, 552-559, 561-568, 573-582, 596-602, 608-630, 
637-649, 651-665, 681-702, 714-732, 739-745, 757-778, 790-805, 807-815, 821-829, 836-842, 846-873, 
880-903, 908-914, 916-923, 931-940, 943-948, 956-970, 975-986, 996-1015, 1031-1040, 1051-1069, 1072- 
1095, 1114-1119, 1130-1148, 1150-1157, 1169-1176, 1229-1238 and 802-812 of Seq ID No 220; 5-12, 14- 
26, 35-47, 52-67, 72-78, 83-98, 121-141, 152-159, 163-183, 186-207, 209-257, 264-277, 282-299, 301-309, 
312-318, 324-339, 358-368, 372-378, 387-397, 425-431 and 46-291 of Seq ID No 221; 29-38, 44-64, 70- 
76, 78-87, 94-100, 102-112, 119-134, 140-149, 163-173, 178-186, 188-194, 207-234, 247-262, 269-290 and 
73-92 of Seq ID No 222; 10-28, 36-63, 77-87, 103-119, 127-136, 141-169, 171-183, 195-200, 207-232, 
236-246, 251-265, 268-283, 287-297, 314-322, 335-343, 354-363, 384-390, 405-411, 419-436, 443-455, 
467-473, 480-513, 518-529, 550-557, 565-585, 602-608, 616-625, 632-660, 665-677, 685-701, 726-736, 
738-747, 752-761, 785-796, 801-813, 838-853, 866-871 and 757-774 of Seq ID No 223; 31-38, 61-66, 74- 
81, 90-115, 123-145, 154-167, 169-179, 182-193, 200-206, 238-244, 267-272 and 235-251 of Seq ID No 
224; 19-25, 38-54, 56-64, 66-72, 74-92, 94-100, 116-129, 143-149, 156-183, 204-232, 253-266, 269-275, 
294-307 and 241-313 of Seq ID No 225; 5-34, 50-56, 60-65, 74-85, 89-97, 108-119, 159-165, 181-199, 
209-225, 230-240, 245-251, 257-262, 274-282, 300-305 and 64-75 of Seq ID No 226; 5-13, 16-21, 27-42, 
45-52, 58-66, 74-87, 108-114, 119-131 and 39-51 of Seq ID No 227; 6-23, 46-54, 59-65, 78-84, 100-120, 
128-133, 140-146, 159-165, 171-183, 190-204, 224-232, 240-248, 250-259, 274-280, 288-296, 306-315 and 
267-274 of Seq ID No 228; 5-12, 15-24, 26-36, 42-65, 68-80, 82-104, 111-116, 125-144, 159-167, 184-189, 
209-218, 235-243, 254-265, 269-283, 287-300, 306-316, 318-336, 338-352, 374-392 and 162-174 of Seq 
ID No 229; 30-42, 45-54 and 25-37 of Seq ID No 230; 10-30, 53-59, 86-95, 116-130, 132-147, 169-189, 
195-201, 212-221, 247-256, 258-265, 278-283, 291-298, 310-316, 329-339, 341-352, 360-367, 388-396, 
398-411, 416-432, 443-452, 460-466, 506-512, 515-521, 542-548 and 419-431 of Seq ID No 231; 4-27, 
30-53, 60-67, 70-90, 92-151, 159-185, 189-195, 198-210, 215-239 and 173-189 of Seq ID No 232; 4-26, 
41-54, 71-78, 116-127, 140-149, 151-158, 161-175, 190-196, 201-208, 220-226, 240-252, 266-281, 298-305, 
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308-318, 321-329, 344-353, 372-378, 384-405, 418-426, 429-442, 457-463, 494-505, 514-522 and 174-188 
of Seq ID No 233; 17-25, 27-39, 61-67, 81-89, 99-110, 120-131, 133-139, 147-161, 167-172, 179-185, 192- 
198, 203-213, 226-238, 243-258, 261-267, 284-290, 296-307, 311-328, 340-352, 356-371 and 239-256 of 
Seq ID No 234; 8-30, 40-49, 67-80, 114-123, 126-142, 152-162, 188-194 and 57-70 of Seq ID No 235; 4- 
23, 28-34, 36-47, 50-61, 76-81, 89-94, 96-104, 112-119, 126-146, 155-181, 195-200, 208-214, 220-229, 244- 
260, 263-276, 282-288, 292-300, 317-323, 336-351, 353-359, 363-375, 382-399, 415-432, 444-455, 458-471, 
476-481, 484-492, 499-517, 522-529, 535-541, 543-568, 572-584, 586-600, 607-617, 626-637, 656-675 and 
282-297 of Seq ID No 236; 6-24, 30-35, 38-45, 63-91, 134-140, 146-160, 167-188, 214-220, 226-234, 244- 
250, 260-270, 286-301, 316-329, 340-371, 429-446, 448-459, 474-481, 485-491, 512-526, 537-544, 550-565, 
573-583, 596-613, 621-630, 652-658 and 87-97 of Seq ID No 237; 8-20, 26-48, 56-67, 76-86, 94-109, 115- 
121, 123-129, 143-160, 178-186, 191-198, 201-208, 221-236, 238-244, 260-268 and 237-247 of Seq ID No 
238; 4-40, 42-57, 73-87, 98-117, 126-135, 150-156, 166-174, 196-217, 231-236, 248-258, 276-284, 293-301, 
307-313, 339-347, 359-365, 375-387, 395-402, 428-440, 445-456, 485-490, 497-505, 535-541, 547-555, 
610-625, 648-656, 665-671 and 448-528 of Seq ID No 239; 10-18, 39-45, 51-61, 80-96, 98-106, 110-115, 
158-172, 174-183, 191-200, 220-237, 249-255, 274-289, 308-324, 331-341, 372-381, 384-397, 405-414 and 
322-338 of Seq ID No 240; 30-36, 38-56, 85-108, 134-147, 149-160, 163-183, 188-201, 206-211, 219-238, 
247-254 and 5-13 of Seq ID No 241; 11-40, 98-103, 110-115, 133-145, 151-159, 172-179, 192-201, 204- 
212, 222-228, 235-245, 258-268, 283-296, 298-309, 322-329, 342-351, 354-362, 372-378, 385-393, 407-418, 
495-516 and 1-148 of Seq ID No 242; 5-19, 21-36, 73-94, 112-119, 122-137, 139-145, 152-167, 184-190, 
198-204, 208-224, 249-265, 267-281, 299-304, 309-317, 326-333, 356-364, 368-374, 381-389, 391-414, 
419-425, 430-435 and 113-140 of Seq ID No 243; 45-54, 59-67, 78-91 and 15-23 of Seq ID No 244; 11- 
22, 33-47, 52-80, 88-112, 124-129 and 6-25 of Seq ID No 245; 26-41, 51-63, 80-89, 93-115, 150-163, 187- 
193, 220-237, 240-249, 286-294, 296-306, 316-329, 345-353, 361-370, 407-425, 428-437, 474-482, 484-494, 
504-517, 533-541, 549-558, 595-613, 616-625, 660-668, 673-685, 711-726, 736-744, 749-761, 787-802, 
812-820, 825-837, 863-878, 888-896, 901-913, 939-954, 964-972, 977-989, 1003-1008, 1016-1022, 1028- 
1034, 1041-1053, 1059-1074, 1101-1122, 420-511 and 581-704 of Seq ID No 246; 18-25, 27-55, 71-83, 
89-95, 102-113, 120-146, 150-156, 174-185 and 159-175 of Seq ID No 247; 24-30, 38-56, 63-68, 87-93, 
136-142, 153-164, 183-199, 213-219, 226-234, 244-261, 269-278, 283-289, 291-297, 320-328, 330-336, 
340-346, 348-356, 358-366, 382-387, 401-408, 414-419, 449-455, 468-491, 504-512, 531-537, 554-560, 
597-608, 621-627, 632-643, 650-662, 667-692, 703-716, 724-737, 743-758, 783-794, 800-818, 846-856 and 
806-884 of Seq ID No 248; 4-14, 21-39, 86-92, 99-107, 121-131, 136-144, 147-154, 158-166, 176-185, 
193-199, 207-222, 224-230 and 117-136 of Seq ID No 249; 65-76, 85-97, 103-109, 115-121, 125-146, 
163-169, 196-205, 212-219, 228-237, 241-247, 254-262, 269-288, 294-303, 305-313, 328-367, 395-401, 
405-412, 418-429, 437-447, 481-488, 506-513, 519-524, 530-541, 546-557 and 266-284 of Seq ID No 250; 
5-14, 37-42, 49-71, 78-92, 97-112, 127-136, 147-154, 156-163, 186-198, 216-225, 233-243, 248-253, 295- 
307, 323-332, 359-366, 368-374, 380-398 and 194-223 of Seq ID No 251; 4-11, 33-39, 45-72, 100-113, 
119-129, 136-144, 169-175, 177-185, 200-208, 210-219, 262-276, 278-297, 320-326, 336-344, 347-362, 
381-394, 443-453 and 438-454 of Seq ID No 252; 4-29, 31-52, 55-61, 95-110, 138-158, 162-171, 179-187, 
202-229, 239-248, 251-256, 262-267, 269-285, 304-310, 351-360, 362-368, 381-388, 415-428, 435-440, 
448-458 and 161-178 of Seq ID No 253; 4-17, 19-28, 32-43, 47-59, 89-110, 112-126, 128-134, 140-148, 
152-161, 169-184, 191-204, 230-235, 255-264, 328-338, 341-347, 401-409, 413-419, 433-441, 449-458, 
463-468, 476-482, 486-492, 500-506, 529-545 and 305-381 of Seq ID No 254; 10-29, 38-45, 53-61, 134- 
145, 152-160, 163-170, 202-208, 219-229, 248-258, 266-275, 282-288, 315-320, 328-334, 377-385, 392-402, 
418-424, 447-453, 460-471, 479-487, 491-497, 500-507, 531-537, 581-594, 615-623, 629-635, 644-652, 
659-666, 668-678, 710-717, 719-728, 736-741, 747-760, 766-773, 784-789, 794-800, 805-817, 855-861, 
866-887 and 698-715 of Seq ID No 255; 16-26, 29-37, 44-58, 62-68, 74-80, 88-95, 97-120, 125-144, 165- 
196 and 58-72 of Seq ID No 256; 14-21, 23-46, 49-60, 63-74, 78-92, 96-103, 117-129, 134-161, 169-211, 
217-231, 239-248, 252-281, 292-299, 313-343 and 243-257 of Seq ID No 257; 11-27, 46-52, 67-72, 76-84, 
91-112, 116-153, 160-175, 187-196, 202-211, 213-220 and 43-76 of Seq ID No 258; 5-29, 37-56, 78-86, 
108-118, 152-161 and 120-130 of Seq ID No 259; 8-14, 19-41, 52-66, 75-82, 87-92, 106-121, 127-133, 
136-143, 158-175, 180-187, 196-204, 221-228, 239-245, 259-265, 291-306, 318-323, 328-340, 352-358, 
361-368, 375-381, 391-399, 411-418, 431-442, 446-455, 484-496, 498-510, 527-533, 541-549, 558-565, 
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575-585, 587-594, 644-655, 661-668, 671-677 and 184-196 of Seq ID No 260; 4-22, 29-38, 55-62, 75-81, 
102-107, 110-134, 143-150, 161-167, 172-179, 191-215, 223-233, 241-247, 251-264, 266-272, 288-309, 
340-352, 354-366, 394-402, 414-438 and 198-218 of Seq ID No 261; 24-44, 49-70, 80-91, 105-118, 128- 
136, 140-154 and 77-92 of Seq ID No 262; 5-22, 31-36, 41-47, 67-74, 83-90, 105-122, 135-143, 160-167 
and 118-129 of Seq ID No 263; 4-25, 33-73, 81-93, 96-106, 114-120, 122-128, 130-172, 179-208, 210- 

241, 251-283, 296-301 and 92-100 of Seq ID No 264; 14-24, 29-38, 43-50, 52-72, 86-97, 101-107, 110- 
125, 127-141, 145-157, 168-175, 177-184, 186-195, 205-226, 238-250, 255-261, 284-290, 293-304, 307-314, 
316-323, 325-356, 363-371, 383-390, 405-415, 423-432, 442-454, 466-485, 502-511, 519-527, 535-556, 
558-565, 569-574, 612-634, 641-655, 672-686, 698-709, 715-722, 724-732, 743-753, 760-769, 783-792, 
818-825, 830-839, 842-849, 884-896, 905-918, 926-940, 957-969, 979-1007, 1015-1021, 1049-1057 and 
336-349 of Seq ID No 265; 6-16, 26-31, 33-39, 62-73, 75-85, 87-100, 113-123, 127-152, 157-164, 168-181, 
191-198, 208-214, 219-226, 233-254, 259-266, 286-329 and 181-195 of Seq ID No 266; 4-13, 32-39, 53- 
76, 99-108, 110-116, 124-135, 137-146, 149-157, 162-174, 182-190, 207-231, 242-253, 255-264, 274-283, 
291-323, 334-345, 351-360, 375-388, 418-425, 456-474, 486-492, 508-517, 520-536, 547-560, 562-577, 31- 
45 and 419-443 of Seq ID No 267; 15-26, 30-37, 42-49, 58-90, 93-99, 128-134, 147-154, 174-179, 190- 
197, 199-205, 221-230, 262-274, 277-287, 300-314, 327-333, 343-351, 359-377, 388-396, 408-413, 416-425, 
431-446 and 246-256 of Seq ID No 268; 5-26, 34-42, 47-54, 61-67, 71-104, 107-115, 131-138, 144-153, 
157-189, 196-202, 204-210, 228-245, 288-309, 316-329, 332-341, 379-386, 393-399, 404-412, 414-421, 
457-468, 483-489, 500-506, 508-517, 523-534, 543-557, 565-580, 587-605, 609-617, 619-627, 631-636, 
640-646, 662-668, 675-682, 705-710, 716-723, 727-732, 750-758, 784-789, 795-809, 869-874, 14-138, 166- 
286, 372-503, 674-696 and 754-859 of Seq ID No 269; 5-17, 32-38, 40-47, 80-89, 113-119, 125-137, 140- 
154, 157-163, 170-177, 185-199, 213-225, 228-236, 242-248, 277-290, 292-305, 323-333, 347-353, 364-370, 
385-394, 399-406, 423-433, 441-451, 462-474, 477-487 and 116-124 of Seq ID No 270; 7-16, 18-30, 32- 
49, 53-61, 63-85, 95-101, 105-115, 119-134, 143-150, 159-178, 185-202, 212-229, 236-250, 254-265, 268- 
294 and 63-72 of Seq ID No 271; 4-12, 19-47, 73-81, 97-103, 153-169, 188-198, 207-213, 217-223, 236- 

242, 255-265, 270-278, 298-305, 309-317, 335-347, 354-363, 373-394, 419-424, 442-465, 486-492, 500-507, 
542-549, 551-558, 560-572, 580-589, 607-614, 617-623, 647-653, 666-676, 694-704, 706-714, 748-754, 
765-772, 786-792, 795-806 and 358-370 of Seq ID No 272; 18-28, 30-38, 40-46, 49-55, 69-78, 82-98, 104- 
134, 147-153, 180-190, 196-202, 218-236, 244-261, 266-273, 275-286, 290-295, 301-314, 378-387, 390-395, 
427-434 and 290-305 of Seq ID No 273; 4-13, 20-31, 39-51, 54-61, 69-84, 87-105, 117-124 and 108-125 
of Seq ID No 274; 24-34, 43-54, 56-66, 68-79 and 50-69 of Seq ID No 275; 5-43, 71-77, 102-131, 141- 
148, 150-156, 159-186, 191-207, 209-234, 255-268, 280-286, 293-299, 317-323, 350-357, 363-372, 391-397, 
406-418, 428-435, 455-465, 484-497, 499-505, 525-531, 575-582, 593-607, 621-633, 638-649, 655-673, 
684-698, 711-725, 736-741, 743-752, 759-769, 781-793, 813-831, 843-853, 894-905, 908-916, 929-946, 
953-963, 970-978, 1001-1007, 1011-1033, 165-178 and 818-974 of Seq ID No 276; 16-44, 63-86, 98-108, 
185-191, 222-237, 261-274, 282-294, 335-345, 349-362, 374-384, 409-420, 424-430, 440-447, 453-460, 
465-473, 475-504, 522-534, 538-551, 554-560, 567-582, 598-607, 611-619, 627-640, 643-653, 655-661, 
669-680, 684-690, 701-707, 715-731, 744-750, 756-763, 768-804, 829-837, 845-853, 855-879, 884-890, 
910-928, 77-90, 144-212, 279-355, 434-536, 782-810 and 875-902 of Seq ID No 277; 4-22, 29-41, 45-51, 
53-66, 70-77, 86-95, 98-104, 106-124, 129-135, 142-151, 153-161, 169-176, 228-251, 284-299, 331-337, 
339-370, 380-387, 393-398, 406-411, 423-433, 440-452, 461-469, 488-498, 501-516, 523-530, 532-559, 
562-567, 570-602, 612-628, 630-645, 649-659, 666-672, 677-696, 714-723, 727-747 and 212-227 of Seq 
ID No 278; 4-9, 17-31, 35-41, 56-61, 66-75, 81-87, 90-124, 133-138, 149-163, 173-192, 213-219, 221-262, 
265-275, 277-282, 292-298, 301-307, 333-346, 353-363, 371-378, 419-430, 435-448, 456-469, 551-570, 
583-599, 603-612 and 275-291 of Seq ID No 279; 28-34, 53-58, 72-81, 100-128, 145-154, 159-168, 172- 
189, 217-225, 227-249, 256-263, 299-309, 322-330, 361-379, 381-388, 392-401, 404-417, 425-436, 440-446, 
451-464, 469-487, 502-511, 543-551, 559-564, 595-601, 606-612, 615-626, 633-642, 644-650, 664-670, 
674-684, 692-701, 715-723, 726-734, 749-756, 763-771, 781-787, 810-843, 860-869, 882-889, 907-917, 
931-936, 941-948, 951-958, 964-971, 976-993, 1039-1049, 1051-1065, 1092-1121, 1126-1132, 1145-1151, 
1158-1173, 1181-1192, 1194-1208, 1218-1223, 1229-1243, 1249-1254, 1265-1279, 1287-1297, 1303-1320, 
1334-1341, 1343-1358, 1372-1382, 1406-1417, 1419-1425, 1428-1434, 1441-1448, 1460-1473, 1494-1504, 
1509-1514, 1529-1550, 654-669 and 1400-1483 of Seq ID No 280; 10-16, 20-25, 58-65, 97-109, 118-132, 
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134-146, 148-155, 186-195, 226-233, 244-262, 275-284, 295-310, 317-322, 330-339, 345-351, 366-375, 
392-403, 408-415, 423-430, 435-444, 446-457, 467-479, 486-499, 503-510, 525-537, 540-585, 602-612, 
614-623, 625-634, 639-645, 650-669, 700-707, 717-724, 727-739, 205-230 and 733-754 of Seq ID No 281; 
5-22, 37-43, 72-81, 105-113, 128-133, 148-160, 188-194, 204-230, 238-245, 251-257 and 194-213 of Seq 
ID No 282; 16-21, 35-41, 56-72, 74-92, 103-109 and 62-68 of Seq ID No 283; 4-15, 17-82, 90-104, 107- 
159, 163-170, 188-221, 234-245, 252-265 and 220-235 of Seq ID No 284; 16-22, 36-46, 61-75, 92-107, 
113-121, 139-145, 148-160 and 30-42 of Seq ID No 285; 4-12, 20-26, 43-49, 55-62, 66-78, 121-127, 135- 
141, 146-161, 164-170, 178-189, 196-205, 233-238, 269-279, 288-318, 325-332, 381-386, 400-407 and 328- 
346 of Seq ID No 286; 5-12, 31-49, 57-63, 69-79, 89-97, 99-114, 116-127, 134-142, 147-154, 160-173, 
185-193, 199-204, 211-222, 229-236, 243-249, 256-274 and 58-68 of Seq ID No 287; 10-20, 28-34, 39-53, 

68- 79, 84-90, 99-106 and 73-79 of Seq ID No 288; 14-37, 45-50, 61-66, 77-82, 93-98, 109-114, 125-130, 
141-146, 157-162, 173-178, 189-194, 205-210, 221-226, 237-242, 253-258, 269-274, 285-290, 301-306, 
316-332, 349-359, 371-378, 385-406, 34-307 and 312-385 of Seq ID No 289; 4-10, 17-38, 50-85, 93-99, 
109-116, 128-185, 189-197, 199-210, 223-256, 263-287, 289-312, 327-337, 371-386, 389-394, 406-419, 
424-432, 438-450, 458-463, 475-502, 507-513, 519-526, 535-542, 550-567 and 361-376 of Seq ID No 290; 
10-39, 42-93, 100-144, 155-176, 178-224, 230-244, 246-255, 273-282, 292-301, 308-325, 332-351, 356-361, 
368-379, 386-393, 400-421 and 138-155 of Seq ID No 291; 5-11, 17-34, 40-45, 50-55, 72-80, 101-123, 
145-151, 164-172, 182-187, 189-195, 208-218, 220-241, 243-252, 255-270, 325-331, 365-371, 391-398, 
402-418, 422-428, 430-435, 443-452, 463-469, 476-484, 486-494, 503-509, 529-553, 560-565, 570-590, 
608-614, 619-627, 654-661, 744-750, 772-780, 784-790, 806-816, 836-853, 876-885, 912-918, 926-933, 
961-975, 980-987, 996-1006, 1016-1028, 1043-1053, 1057-1062, 994-1003 and 1033-1056 of Seq ID No 
292; 17-45, 64-71, 73-81, 99-109, 186-192, 223-238, 262-275, 283-295, 336-346, 350-363, 375-385, 410- 
421, 425-431, 441-448, 454-463, 468-474, 476-512, 523-537, 539-552, 568-583, 599-608, 612-620, 628-641, 
644-654, 656-662, 670-681, 685-695, 702-708, 716-723, 725-735, 757-764:, 769-798, 800-806, 808-816, 
826-840, 846-854, 856-862, 874-881, 885-902, 907-928, 274-350 and 443-513 of Seq ID No 293; 4-22, 
29-41, 45-51, 53-61, 70-76, 85-92, 99-104, 111-122, 134-140, 142-154, 163-174, 224-232, 255-265, 273- 
279, 283-297, 330-335, 337-348, 356-367, 373-385, 391-396, 421-431, 442-455, 475-485, 493-505, 526-538, 
544-561, 587-599, 605-620, 622-651, 662-670, 675-681, 687-692, 697-712, 714-735 and 252-262 of Seq 
ID No 294; 4-12, 15-35, 40-46, 50-59, 67-94, 110-128, 143-169, 182-188, 207-215, 218-228, 238-250 and 
74-90 of Seq ID No 295; 9-18, 42-58, 78-85, 88-95, 97-106, 115-122, 128-134, 140-145, 154-181, 186-202, 
204-223, 261-267, 269-278, 284-293, 300-336, 358-368 and 12-29 of Seq ID No 296; 7-34, 46-53, 62-72, 
82-88, 100-105, 111-117, 132-137, 144-160, 166-180, 183-189, 209-221, 231-236, 246-253, 268-282, 286- 
293, 323-336, 364-372, 378-392, 422-433 and 388-405 of Seq ID No 297; 21-27, 34-50, 72-77, 80-95, 
164-177, 192-198, 202-220, 226-236, 239-247, 270-279, 285-292, 315-320, 327-334, 348-355, 364-371, 
388-397, 453-476, 488-497, 534-545, 556-576, 582-588, 601-607, 609-616, 642-662, 674-681, 687-697, 
709-715, 721-727, 741-755 and 621-739 of Seq ID No 298; 4-14, 16-77, 79-109 and 25-99 of Seq ID No 
299; 4-9, 17-23, 30-37, 44-55, 65-72, 77-93, 102-121, 123-132, 146-153 and 17-29 of Seq ID No 300; 4- 
18, 25-41, 52-60, 83-92, 104-112, 117-123, 149-155, 159-167, 170-192, 201-210, 220-227, 245-250 and 
124-137 of Seq ID No 301; 8-25, 50-55, 89-95, 138-143, 148-153, 159-169, 173-179, 223-238, 262-268, 
288-295, 297-308, 325-335, 403-409, 411-417, 432-446, 463-475, 492-501, 524-530, 542-548, 561-574, 
576-593, 604-609, 612-622, 637-654, 665-672, 678-685, 720-725, 731-739, 762-767, 777-783, 820-838, 
851-865, 901-908, 913-920, 958-970, 1000-1006, 1009-1015, 1020-1026, 1043-1052, 1055-1061, 1-128, 
252-341, 771-793 and 1043-1058 of Seq ID No 302; 16-26, 33-46 and 64-76 of Seq ID No 303; 4-27, 

69- 77, 79-101, 117-123, 126-142, 155-161, 171-186, 200-206, 213-231, 233-244, 267-273, 313-329, 335- 
344, 347-370, 374-379, 399-408, 422-443, 445-453, 461-468, 476-482, 518-534, 544-553, 556-567, 578-595, 
601-620, 626-636, 646-658, 666-681, 715-721, 762-768, 778-785, 789-803, 809-819, 22-108, 153-318, 391- 
527 and 638-757 of Seq ID No 304; 6-21, 32-43, 62-92, 104-123, 135-141, 145-152, 199-216, 218-226, 
237-247, 260-269, 274-283, 297-303, 1-72 and 127-211 of Seq ID No 305; 6-26, 50-56, 83-89, 108-114, 
123-131, 172-181, 194-200, 221-238, 241-247, 251-259, 263-271, 284-292, 304-319, 321-335, 353-358, 
384-391, 408-417, 424-430, 442-448, 459-466, 487-500, 514-528, 541-556, 572-578, 595-601, 605-613, 
620-631, 635-648, 660-670, 673-679, 686-693, 702-708, 716-725, 730-735, 749-755, 770-777, 805-811, 
831-837, 843-851, 854-860, 863-869, 895-901, 904-914, 922-929, 933-938, 947-952, 956-963, 1000-1005, 
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1008-1014, 1021-1030, 1097-1103, 1120-1130, 1132-1140, 1-213, 269-592 and 992-1120 of Seq ID No 
306; 9-16, 33-39, 47-59, 65-79, 81-95, 103-108, 115-123, 138-148, 163-171, 176-185, 191-196, 205-211, 
213-221, 224-256, 261-276, 294-302, 357-363, 384-390, 95-111 and 161-189 of Seq ID No 307; 21-27, 
35-45, 70-76, 92-105, 129-143, 145-155, 161-166, 170-191, 204-211, 214-231, 234-246, 249-255, 259-275 
and 1-18 of Seq ID No 308; 21-35, 45-53, 56-64, 69-97 and 1-16 of Seq ID No 309; 25-33, 41-47, 61-68, 
86-101, 106-114, 116-129, 134-142, 144-156, 163-176, 181-190, 228-251, 255-261, 276-292, 295-305, 334- 
357, 368-380, 395-410, 424-429, 454-460, 469-482, 510-516, 518-527, 531-546, 558-570, 579-606, 628-636, 
638-645, 651-656, 668-674, 691-698, 717-734, 742-754, 765-770, 792-797, 827-835, 847-859, 874-881, 
903-909, 926-933, 942-961, 964-977, 989-1004, 1010-1028, 1031-1047, 1057-1075, 1081-1095, 1108-1117, 
1138-1144, 1182-1189, 1193-1206, 1220-1229, 1239-1246, 1257-1267, 1271-1279, 1284-1301, 1312-1320, 
1329-1335, 1341-1347, 1358-1371, 1399-1404, 1417-1426, 1458-1463, 1468-1476, 1478-1485, 1493-1506, 
1535-1541, 1559-1574, 1583-1590, 1595-1601, 1603-1611, 1622-1628, 1634-1644, 1671-1685, 1689-1696, 
1715-1720, 1734-1746, 1766-1775, 1801-1806, 1838-1844, 1858-1871, 1910-1917, 1948-1955, 1960-1974, 
2000-2015, 2019-2036, 2041-2063, 748-847 and 1381-1391 of Seq ID No 310; 5-12, 18-24, 27-53, 56-63, 
96-113, 119-124, 131-136, 157-163, 203-209, 215-223, 233-246, 264-273, 311-316, 380-389, 393-399, 425- 
433, 445-450, 457-462, 464-470, 475-482, 507-513, 527-535, 542-548, 550-565, 591-602, 607-613, 627-642, 
644-664, 673-712, 714-732, 739-764, 769-782, 812-818, 826-838, 848-854, 860-871, 892-906, 930-938, 
940-954, 957-973, 990-998, 1002-1021, 1024-1033, 1037-1042, 1050-1060, 1077-1083, 1085-1092, 1100- 
1129, 1144-1161, 1169-1175, 1178-1189, 1192-1198, 1201-1207, 1211-1221, 1229-1239, 1250-1270, 1278- 
1292, 1294-1300, 1314-1335, 1344-1352, 1360-1374, 1394-1405, 1407-1414, 1416-1424, 1432-1452, 1456- 
1462, 1474-1497, 1500-1510, 1516-1522, 1534-1542, 1550-1559, 1584-1603, 1608-1627, 187-273 and 306- 
441 of Seq ID No 311; 70-80, 90-97, 118-125, 128-140, 142-148, 154-162, 189-202, 214-222, 224-232, 
254-260, 275-313, 317-332, 355-360, 392-398, 425-432, 448-456, 464-470, 476-482, 491-505, 521-528, 
533-546, 560-567, 592-597, 605-614, 618-626, 637-644, 646-653, 660-666, 677-691 and 207-227 of Seq 
ID No 312; 5-19, 26-34, 37-55, 57-66, 69-83, 86-102, 115-134, 138-143, 154-172, 178-195, 209-246, 251- 
257, 290-302, 306-311 and 256-266 of Seq ID No 313; 10-20, 22-28, 35-57, 72-79, 87-103, 108-128, 130- 
144, 158-171, 190-198, 225-242, 274-291, 301-315, 317-324, 374-385 and 353-365 of Seq ID No 314; 4-9, 
17-30, 34-54, 59-66, 73-94, 118-130, 135-150, 158-171, 189-198, 219-239, 269-275, 283-301, 89-106 and 
176-193 of Seq ID No 315; 14-20, 22-74, 77-86, 89-99, 104-109, 126-135, 154-165, 181-195, 197-212, 
216-224, 264-275 and 107-118 of Seq ID No 316; 4-18, 21-38, 63-72, 101-109, 156-162, 165-179, 183- 
192, 195-210, 212-218, 230-239, 241-256, 278-290, 299-311, 313-322, 332-341, 348-366, 386-401, 420-426, 
435-450, 455-460, 468-479, 491-498, 510-518, 532-538, 545-552, 557-563, 567-573, 586-595, 599-609, 
620-626, 628-636, 652-657, 665-681 and 1-198 of Seq ID No 317; 4-10, 16-38, 51-68, 73-79, 94-115, 120- 
125, 132-178, 201-208, 216-223, 238-266, 269-295, 297-304, 337-342, 347-356, 374-401, 403-422, 440-447, 
478-504, 510-516, 519-530, 537-544 and 191-206 of Seq ID No 318; 12-40, 42-48, 66-71, 77-86, 95-102, 
113-120, 129-137, 141-148, 155-174, 208-214, 218-225, 234-240, 256-267, 275-283, 300-306, 313-321, 
343-350, 359-367, 370-383, 398-405, 432-439, 443-461, 492-508, 516-526, 528-535 and 370-478 of Seq 
ID No 319; 6-14, 20-37, 56-62, 90-95, 97-113, 118-125, 140-145, 161-170, 183-202, 237-244, 275-284, 
286-305, 309-316, 333-359, 373-401, 405-412 and 176-187 of Seq ID No 320; 33-44, 50-55, 59-80, 86- 
101, 129-139, 147-153, 157-163, 171-176, 189-201, 203-224, 239-245, 257-262, 281-287, 290-297, 304-320, 
322-331, 334-350, 372-390, 396-401, 71-88 and 353-372 of Seq ID No 321; 5-11, 15-24, 26-33, 40-47, 75- 
88, 95-103, 105-112 and 17-30 of Seq ID No 322; 5-11, 16-39, 46-54, 62-82, 100-107, 111-124, 126-150, 
154-165, 167-183, 204-238, 245-295, 301-313, 316-335 and 8-16 of Seq ID No 323; 4-19, 34-48, 69-74, 
79-107, 115-127, 129-135, 143-153, 160-169, 171-182 and 142-153 of Seq ID No 324; 4-30, 65-74, 82- 
106, 110-120, 124-132, 135-140, 146-175, 179-184, 190-196, 217-223, 228-233, 250-267, 275-292, 303-315, 
322-332 and 174-186 of Seq ID No 325; 9-16, 29-41, 47-57, 68-84, 87-109, 113-119, 162-180, 186-193, 
195-201, 203-208, 218-230, 234-243, 265-271, 281-292, 305-312, 323-332, 341-347, 349-363, 368-374, 
383-390, 396-410, 434-440, 446-452, 455-464, 466-473, 515-522, 529-542, 565-570, 589-600, 602-613, 
618-623, 637-644, 1019-1027, 1238-1244, 1258-1264, 1268-1276, 1281-1292, 1296-1302 and 883-936 of 
Seq ID No 326; 10-17, 23-32, 39-44, 54-72, 75-81, 88-111, 138-154, 160-167, 178-185, 201-210, 236-252, 
327-334, 336-342, 366-376, 388-400, 410-430, 472-482, 493-526, 552-558, 586-592, 598-603, 612-621, 
630-635, 641-660 and 384-393 of Seq ID No 327; 4-22, 24-39, 50-59, 73-84, 100-105, 111-117, 130-138, 
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155-161, 173-178, 182-189, 205-215, 266-284, 308-313, 321-328, 330-337, 346-363, 368-374, 388-395, 
397-405, 426-434, 453-459, 482-492, 501-507, 509-515, 518-523, 527-544, 559-590, 598-612, 614-629, 
646-659, 663-684, 686-694, 698-721 and 445-461 of Seq ID No 328; 14-22, 27-33 and 3-17 of Seq ID 
No 329; 29-41, 66-73, 81-87, 90-108, 140-146, 150-159, 165-184, 186-196, 216-226, 230-238, 247-253, 
261-269 and 126-140 of Seq ID No 330; 5-12, 16-25, 27-33, 36-45, 60-68, 83-88, 103-126 and 86-101 of 
Seq ID No 331; 14-23, 36-47, 56-66, 84-89, 94-105, 111-127, 140-153, 160-174, 176-183, 189-203, 219- 
225, 231-237, 250-257 and 194-227 of Seq ID No 332; 4-25, 54-60, 64-71, 73-82, 89-106, 117-124, 157- 
169, 183-188, 199-210, 221-232, 236-244, 255-264 and 58-98 of Seq ID No 333; 13-19, 26-36, 41-53, 55- 
71, 77-84, 86-108, 114-135, 157-172, 177-183, 187-194, 208-213, 218-226, 110-125 and 156-170 of Seq 
ID No 334; 5-24, 63-69, 77-85, 94-112, 120-137, 140-146, 152-159, 166-172, 179-187, 193-199, 206-212, 
222-228, 234-240, 244-252, 257-264, 270-289, 298-309, 316-328, 337-348, 363-375, 1-56 and 340-352 of 
Seq ID No 335; 18-39, 42-71, 78-120, 124-144, 152-173, 179-189, 199-209, 213-222, 228-258, 269-304, 
329-361, 364-372, 374-389, 396-441 and 313-327 of Seq ID No 336; 19-25, 91-98, 108-120, 156-162, 
168-174, 191-204, 211-216, 232-266, 272-278, 286-308, 316-321, 327-333, 344-355, 358-364, 384-391, 
395-428, 464-476, 487-495, 497-511, 544-561, 563-573, 575-582, 588-594, 10-25 and 322-338 of Seq ID 
No 337; 14-26, 32-49, 51-57, 59-72, 80-91, 102-112, 119-125, 147-161, 164-173, 175-183, 188-213, 217- 
222, 246-254, 260-276, 282-303, 308-318, 321-328, 333-350, 352-359, 371-378, 392-401, 407-414, 416-443, 
448-463, 471-484, 490-497, 501-514, 519-527, 539-551, 557-570, 578-590, 592-598, 600-610, 618-629, 
633-647, 654-667, 676-689, 702-709, 718-726, 728-737, 741-760, 764-780, 786-795, 808-826, 836-842, 
845-852, 865-874, 881-887, 931-945, 949-957, 968-974, 979-986, 1003-1009, 1023-1029 and 90-103 of 
Seq ID No 338; 11-16, 37-56, 60-66, 69-77, 80-88, 93-106, 117-139, 166-171 and 72-90 of Seq ID No 
339; 59-84, 123-133, 145-150, 161-167, 178-189 and 115-128 of Seq ID No 340; 15-33, 39-46, 52-64, 74- 
87, 108-124, 127-144, 150-156, 173-179, 184-194, 201-208, 219-236, 243-269, 272-295, 302-309, 343-349, 
356-361, 370-379, 405-411, 414-423, 430-451, 457-464, 466-475, 477-483, 496-502, 507-522, 541-548, 
557-563, 571-577, 579-585, 590-605, 626-642, 650-662, 671-691, 704-710, 751-769, 775-781, 786-791, 
794-829, 851-858, 868-878, 884-904, 913-919, 931-939 and 132-142 of Seq ID No 341; 33-58, 64-71, 74- 
80, 83-88, 96-120, 122-139, 146-157, 167-177, 207-213, 220-225, 236-242, 264-279, 300-305, 326-336, 
340-347, 350-360, 97-115 and 199-211 of Seq ID No 342; 4-26, 43-57, 70-99, 102-117, 121-133, 142-148, 
151-168, 170-183, 192-220, 235-249, 258-279 and 30-41 of Seq ID No 343; 34-42, 48-58, 70-94, 110-130, 
154-160, 164-172, 178-183, 195-203, 211-222, 229-250, 256-261, 274-284, 286-292, 312-323 and 222-233 
of Seq ID No 344; 4-9, 15-36, 38-45, 49-74, 78-88, 100-112, 136-191, 211-220, 226-233, 239-246, 254- 
274, 287-307, 316-322, 342-353, 356-366, 373-378, 384-393, 405-431, 449-457, 459-468, 487-511, 515-524, 
529-541, 544-552, 562-568, 571-576 and 208-280 of Seq ID No 345; 10-27, 31-37, 39-54, 71-108, 124- 
143 and 2-107 of Seq ID No 346; 16-27, 38-57, 64-70, 90-102, 104-113, 116-137, 160-166 and 1-80 of 
Seq ID No 347; 13-21, 31-36, 56-67, 127-136, 153-171, 173-180, 184-200, 214-222, 225-231, 239-263, 
267-273 and 135-159 of Seq ID No 348; 12-27, 31-51, 68-74, 77-87 f 94-101, 108-114, 117-123, 127-134, 
138-168, 173-196, 201-207, 212-217, 227-237, 247-257, 264-280 and 205-223 of Seq ID No 349; 17-22, 
25-54, 70-76, 92-100 and 98-110 of Seq ID No 350; 7-29, 40-50, 60-67, 87-96, 105-111, 119-164, 172- 
199, 206-212, 220-227, 237-259, 272-279, 282-293, 295-309, 313-319, 321-328, 345-363, 376-386 and 159- 
176 of Seq ID No 351; 4-19, 24-30, 36-43, 50-68, 71-89, 93-106, 141-152, 154-172, 179-197, 199-215, 
229-239, 246-252, 255-263, 281-298, 319-325, 329-356, 358-368, 374-390, 397-409, 420-429, 432-444, 
450-456, 459-475, 483-494, 496-502, 520-528, 532-556 and 362-377 of Seq ID No 352; 18-25, 40-62, 77- 
85, 91-97, 105-116, 123-133, 139-184, 189-197 and 122-140 of Seq ID No 353; 4-49, 52-58, 62-70, 79- 
105, 109-133, 142-150, 163-168, 206-214, 220-228, 233-240, 243-254, 274-281, 303-311, 327-338, 357-373, 
378-396, 403-413, 420-436, 441-453, 461-467, 475-481, 484-498, 506-512, 514-521, 523-529, 562-579, 
589-595, 598-603, 615-648, 714-722, 728-742, 749-758, 777-792, 795-807 and 643-658 of Seq ID No 354; 
8-27, 37-48, 51-56, 72-79, 87-106, 120-138, 140-147, 167-176, 187-197, 205-216, 222-229, 234-239, 243- 
249, 277-288, 292-315, 334-343, 347-353, 363-391, 398-404, 430-447, 461-467, 478-492, 498-507 and 456- 
470 of Seq ID No 355; 5-12, 18-24, 59-69, 80-93, 95-109, 119-125, 130-137, 139-147, 158-163, 168-176, 
182-202, 206-215, 222-239, 241-249, 267-277, 291-298, 311-318, 321-327, 338-344, 348-355, 373-386, 
393-406, 411-417, 434-443, 446-465, 473-484, 514-521, 532-553, 584-594 and 221-237 of iSeq ID No 356; 
4-14, 27-34, 50-58, 63-72, 79-106, 109-114, 121-142, 146-154, 161-167, 169-175, 178-201, 223-238, 249- 
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254, 259-264, 278-292, 294-312, 319-330 and 167-191 of Seq ID No 357; 7-28, 36-42, 50-61, 63-80, 122- 
152, 161-174, 176-191 and 140-190 of Seq ID No 358; 20-57, 59-65, 70-78, 86-102, 119-133, 142-161, 
163-173, 177-188, 192-202, 204-220, 222-236, 240-253, 279-319, 326-331, 337-383, 390-399, 406-412, 
420-427, 431-438 and 381-395 of Seq ID No 359; 13-18, 28-34, 37-43, 50-59, 75-81, 83-97, 105-121, 139- 
147, 200-206, 209-227, 231-247, 260-271, 318-327, 366-381, 388-394, 399-406 and 182-201 of Seq ID No 
360; 6-29, 37-43, 51-56, 70-77, 82-102, 110-119, 127-143, 178-190, 201-209, 216-243, 261-269, 281-292, 
305-313, 327-339, 341-354, 356-373, 391-397, 423-429, 438-445, 450-478 and 21-314 of Seq ID No 361; 
4-12, 15-21, 32-41, 59-76, 80-89, 96-104 and 90-103 of Seq ID No 362; 9-28, 30-41, 44-54, 69-74, 77-82, 

90- 97, 104-123, 125-135, 149-155, 164-173, 177-184, 217-226, 230-235, 238-244, 258-272, 282-297, 300- 
305, 309-315, 317-322, 327-336, 348-362, 368-374, 380-387, 400-411, 414-424, 451-458, 460-466, 483-494, 
497-503, 506-511, 521-528, 540-553, 569-587, 598-606, 628-642, 661-681, 688-700, 718-733, 740-749, 
752-764, 769-783, 823-834, 848-854, 862-872, 878-884, 886-898, 915-920, 938-951, 954-961, 963-972, 
982-989, 996-1003, 1010-1016, 1021-1032, 1038-1044, 1047-1057, 1060-1070, 1079-1088, 1094-1102, 
1117-1127, 1129-1135, 1142-1153, 1158-1204, 1212-1229, 1234-1263, 1269-1277, 1308-1313, 1327-1338, 
1344-1376, 1400-1415, 1436-1443, 1448-1458, 1497-1504, 1511-1522, 1544-1566, 3-82 and 509-576 of 
Seq ID No 363; 8-36, 40-64, 71-79, 88-94, 102-109, 118-127, 138-148, 151-159, 163-174, 192-198, 200- 
206, 220-233, 268-273, 290-301, 304-309, 316-323, 331-349, 378-391, 414-420, 427-437, 455-475, 494-510, 
541-547, 549-555, 616-640, 1-60, 55-139, 212-308, 386-458 and 458-624 of Seq ID No 364; 16-31, 35-42, 
70-77, 91-101, 120-130, 132-140, 143-153, 185-190, 195-202, 215-222, 228-238, 241-251, 257-264, 268- 
277, 288-302, 312-324, 326-333, 341-348, 364-382, 415-429, 438-454, 458-466, 491-499, 501-521 and 273- 
281 of Seq ID No 365; 8-14, 32-57, 74-149, 155-177, 179-212, 221-266, 271-296, 304-324, 329-346, 349- 
359, 368-401, 413-419, 426-454, 465-478, 493-510 and 466-490 of Seq ID No 366; 22-28, 33-51, 64-89, 
96-119, 126-132, 138-146, 152-159, 161-169, 172-179, 193-198, 205-211, 221-231, 235-254, 273-280, 297- 
303, 312-320, 328-346, 351-373, 378-384, 391-398, 448-454, 460-468, 470-481, 516-558, 574-593, 597-602, 
613-623, 626-646, 649-656, 668-673, 675-683, 696-708, 715-722, 724-739, 745-751, 759-777, 780-804, 
816-822 and 102-113 of Seq ID No 367; 12-28, 41-91, 98-107, 112-120, 125-131, 151-193, 215-221, 240- 
250, 263-280 and 128-138 of Seq ID No 368; 16-24, 32-38, 46-62, 68-81, 90-105, 127-133, 144-150, 160- 
166, 178-184, 186-202, 210-219, 232-240, 252-258, 264-273, 293-324, 337-344, 349-357, 360-369, 385-398, 
410-416, 419-427, 441-449, 458-476, 508-515, 523-539, 544-549, 562-569, 571-579, 96-109 and 127-139 
of Seq ID No 369; 19-25, 28-34, 56-61, 85-97, 110-116 and 39-53 of Seq ID No 370; 4-37, 41-50, 62-72, 

91- 97, 99-109, 114-125, 136-141, 149-158, 160-166, 201-215 and 27-225 of Seq ID No 371; 15-31, 44-51, 
96-105, 122-130, 149-157, 162-168, 178-183, 185-192, 198-204, 206-213, 221-234, 239-245, 248-255, 257- 
266, 289-335, 349-357, 415-422, 425-441, 448-454, 462-468 and 463-481 of Seq ID No 372; 5-31, 39-55, 
63-72, 76-99, 106-155, 160-177, 179-199, 207-217, 223-240, 245-255, 261-267, 294-316, 321-343, 354-378, 
382-452, 477-488, 529-536, 555-569, 584-591, 593-612, 620-627, 632-640, 647-654, 671-680, 698-704, 
723-730, 732-750, 769-775, 781-788, 822-852 and 505-525 of Seq ID No 373; 3-18 of Seq ID No 374; 4- 
14 and 12-24 of Seq ID No 375; 4-11, 22-30 and 12-25 of Seq ID No 376; 5-12 and 4-18 of Seq ID No 
377; 4-28 and 7-14 of Seq ID No 378; 6-16 and 8-16 of Seq ID No 379; 4-15, 18-33 and 24-36 of Seq 
ID No 380; 4-10, 16-21 and 20-31 of Seq ID No 381; 6-19 of Seq ID No 382; 11-18 and 3-10 of Seq 
ID No 383; 13-24 and 3-15 of Seq ID No 384; 15-27 and 7-16 of Seq ID No 385; 11-16 and 1-15 of 
Seq ID No 386; 4-16 and 9-21 of Seq ID No 387; 4-24, 40-48, 54-67 and 22-39 of Seq ID No 388; 6- 
30, 34-55, 62-68, 78-106 and 68-74 of Seq ID No 389; 3-14 of Seq ID No 390; 9-19 and 6-21 of Seq ID 
No 391; 4-17 and 1-9 of Seq ID No 392; 5-30 and 1-8 of Seq ID No 393; 4-16, 23-46, 51-56 and 45-55 
of Seq ID No 394; 7-16 of Seq ID No 395; 2-14 of Seq ID No 396; 4-36, 43-65 and 50-62 of Seq ID 
No 397; 10-30 and 14-21 of Seq ID No 398; 9-17 and 1-10 of Seq ID No 399; 4-12 and 3-16 of Seq ID 
No 400; 4-15 and 5-23 of Seq ID No 401; 10-21 of Seq ID No 402; 6-16 of Seq ID No 403; 4-29, 31-38 
and 2-14 of Seq ID No 404; 4-35 and 33-42 of Seq ID No 405; 2-17 of Seq ID No 406; 9-18, 30-35 
and 15-33 of Seq ID No 407; 4-9 and 6-12 of Seq ID No 408; 3-17 of Seq ID No 409; 12-21, 37-44, 52- 
61, 72-80 and 38-48 of Seq ID No 410; 4-10, 29-44, 54-61, 69-78 and 13-27 of Seq ID No 411; 13-23, 
36-53 and 2-15 of Seq ID No 412; 4-25, 28-46, 56-72, 81-99, 120-132, 134-142, 154-160 and 129-141 of 
Seq ID No 413; 4-15, 24-33, 35-41, 64-86 and 21-33 of Seq ID No 414; 9-15 and 4-13 of Seq ID No 
415; 4-11, 13-19, 34-48 and 15-32 of Seq ID No 416; 4-21 and 11-31 of Seq ID No 417; 23-57 and 38- 
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50 of Seq ID No 418; 4-32 and 3-13 of Seq ID No 419; 4-10, 13-25, 32-42, 56-68, 72-84 and 26-38 of 
Seq ID No 420; 4-20, 31-48, 52-58, 65-71, 80-93, 99-108, 114-123 and 37-49 of Seq ID No 421; 6-12, 
14-20 and 3-25 of Seq ID No 422; 14-25, 27-38 and 5-14 of Seq ID No 423; 4-41, 57-105, 109-118, 123- 
136, 144-152 and 86-99 of Seq ID No 424; 6-19 of Seq ID No 425; 2-19 of Seq ID No 426; 14-47 and 
1-14 of Seq ID No 427; 4-21, 29-44 and 2-18 of Seq ID No 428; 23-29 and 10-28 of Seq ID No 429; 6- 
16, 22-36 and 11-22 of Seq ID Mo 430; 4-19, 30-44 and 18-27 of Seq ID No 431; 5-15, 37-45, 58-65 and 
38-47 of Seq ID No 432; 4-15, 23-34 and 4-15 of Seq ID No 433; 30-36, 44-54, 79-85, 101-114, 138-152, 
154-164, 170-175, 179-200, 213-220, 223-240, 243-255, 258-264, 268-284 and 10-28 of Seq ID No 434; 
the peptides comprising amino acid sequences of column "Identical region" of the Table IB, 
especially peptides comprising amino acid 210-226 and 738-753 of Seq ID No 449; 326-344, 326- 
348, 338-354, 371-392, 801-809 and 877-901 of Seq ID No 450; 893-906 of Seq ID No 451; 51-69 of 
Seq ID No 452; 110-125 of Seq ID No 453; 291-305 of Seq ID No 454; 210-226 and 738-753 of Seq 
ID No 455; 326-344, 326-348, 338-354, 371-392, 801-809 and 877-901 of Seq ID No 456; 893-906 of 
Seq ID No 457; 51-69 of Seq ID No 458; 110-125 of Seq ID No 459; 291-305 of Seq ID No 460; 32-44 
of Seq ID No 461; 399-410 of Seq ID No 462; the serum reactive epitopes as specified in the 
column of "aa from" to "aa to" of Table 2, especially peptides comprising amino acid 120-143, 
138-161 and 156-179 of Seq ID No 218; 110-129 and 168-184 of Seq ID No 219; 74-90 of Seq ID No 
222; 759-773 of Seq ID No 223; 237-260 of Seq ID No 224; 265-284 of Seq ID No 225; 65-74 of Seq 
ID No 226; 41-50 of Seq ID No 227; 163-174 of Seq ID No 229; 26-37 of Seq ID No 230; 174-189 of 
Seq ID No 232; 240-256 of Seq ID No 234; 285-297 of Seq ID No 236; 238-247 of Seq ID No 238; 
491-519 of Seq ID No 239; 114-140 of Seq ID No 243; 267-284 of Seq ID No 250; 439-453 of Seq ID 
No 252; 162-178 of Seq ID No 253; 347-364 of Seq ID No 254; 699-715 of Seq ID No 255; 60-71 of 
Seq ID No 256; 244-257 of Seq ID No 257; 44-63 and 57-76 of Seq ID No 258; 185-196 of Seq ID No 
260; 119-129 of Seq ID No 263; 182-195 of Seq ID No 266; 32-44 and 424-442 of Seq ID No 267; 247- 
256 of Seq ID No 268; 678-694, 785-805, 55-77 and 72-94 of Seq ID No 269; 210-226 of Seq ID No 
281; 37-59 of Seq ID No 289; 13-29 of Seq ID No 296; 136-159 of Seq ID No 348; 205-222 of Seq ID 
No 349; 99-110 of Seq ID No 350; 160-176 of Seq ID No 351; 457-470 of Seq ID No 355; 221-237 of 
Seq ID No 356; 167-190 of Seq ID No 357; 96-120 of Seq ID No 361; 399-417, 503-519 and 544-563 
of Seq ID No 364; 46-68, 159-183 and 184-198 of Seq ID No 371; 463-481 of Seq ID No 372; the 
immunogenic epitopes as specified in the column of "aa from" to "aa to" of Table 4; especially 
peptides comprising amino acid 110-129 and 168-184 of Seq ID No 219; 877-901, 333-354, 326-344 
and 801-809 of Seq ID No 277; 1-54 of Seq ID No 347; 544-563, 31-51, 107-119, 399-417 and 503-519 
of Seq ID No 364; 120-198 of Seq ID No 218; 20-35 of Seq ID No 219; 118-201 of Seq ID No 221; 48- 
132 of Seq ID No 242; 118-136 of Seq ID No 249; 162-178 of Seq ID No 253; 347-364 of Seq ID No 
254; 699-715 of Seq ID No 255; 50-76 of Seq ID No 258; 785-819 and 44-128 of Seq ID No 269; 90- 
128 of Seq ID No 274; 314-384 of Seq ID No 289; 327-349 of Seq ID No 293; 242-314, 405-478 and 
23-100 of Seq ID No 304; 129-210 of Seq ID No 305; 162-188 of Seq ID No 307; 750-772 of Seq ID 
No 310; 1-56 of Seq ID No 335; 322-337 of Seq ID No 337; 72-90 of Seq ID No 339; 374-395 of Seq 
ID No 345; 136-159 of Seq ID No 348; 141-164 of Seq ID No 358; 96-157 of Seq ID No 361; 1-82 of 
Seq ID No 363; 489-556 of Seq ID No 364; 159-183 and 49-133 of Seq ID No 371; The peptides 
comprising amino acid sequences of column "predicted immunogenic aa" and "location of 
identified immunogenic region (aa)" of Table 5, especially peptides comprising amino acid 4-26, 
35-41, 53-61, 73-84, 103-108, 114-120, 140-146, 156-162, 192-208, 214-219, 227-233, 239-252, 260-268, 
284-297, 1-48 and 113-133 of Seq ID No 475; 4-27, 38-44, 50-56, 59-64, 72-79, 83-89, 92-97, 108-116, 
123-148, 152-167, 183-196, 200-220, 232-244, 255-261, 265-274, 282-302, 309-317, 1-79 and 231-302 of 
Seq ID No 476; 6-28, 66-72, 85-105, 115-121, 144-151, 160-170, 176-185, 223-230, 252-288, 296-310, 
319-333, 367-374, 458-464, 471-480, 483-488, 520-528, 530-549, 559-564, 593-601, 606-616, 636-643, 
655-662, 676-682, 684-699, 719-726, 735-750, 757-764:, 777-785, 799-810, 812-843, 846-853, 868-873, 
880-889, 891-899, 909-929, 934-940, 963-969, 998-1004, 1007-1014, 1016-1022, 1030-1046, 1-80 and 808- 
821 of Seq ID No 477; 7-24, 35-41, 75-81, 91-114, 122-132, 137-144, 148-156, 183-192, 194-200, 212- 
228, 233-238, 251-258, 275-295, 326-332, 337-346, 1-79 and 305-321 of Seq ID No 478; 31-38, 42-52, 
66-72, 86-92, 98-104, 115-122, 127-146, 154-164, 169-187, 198-212, 225-237, 255-269, 13-92 and 135-142 
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of Seq ID No 479; 4-36, 39-49, 63-69, 71-77, 81-88, 123-131, 133-139, 160-169, 174-180, 188-194, 210- 
217, 273-278, 289-300, 317-334, 336-341, 383-401, 425-438, 1-68, 212-270 and 402-446 of Seq ID No 
480; 21-29, 31-42, 49-63, 72-79, 81-93, 112-132, 159-165, 188-195, 197-232, 262-267, 279-286, 294-301, 
318-326, 348-366, 381-405, 409-426, 436-465, 471-480, 484-492, 497-505, 521-544, 554-561, 567-577, 
581-589, 601-609, 611-622, 636-651, 653-667, 669-685, 700-708, 716-722, 729-744, 749-766, 780-786, 
789-811, 814-864, 1-57 and 84-106 of Seq ID No 481; 6-24, 35-48, 57-63, 72-78, 87-92, 113-119, 123- 
137, 147-153, 173-181, 212-233 and 1-124 of Seq ID No 482; 13-34, 62-69, 78-83, 86-91, 98-104, 107- 
115, 146-159, 179-188, 195-205, 209-221, 226-233, 239-253, 276-282, 284-294, 297-308, 331-354, 375-382, 
388-399, 421-433, 449-458, 464-469, 472-491, 508-513, 525-531, 534-550, 575-593, 601-618, 629-635, 
654-661, 666-680, 706-721, 723-740, 771-805, 810-830, 845-851 and 1-84 of Seq ID No 483; 4-32, 45-64, 
73-83, 86-92, 100-111, 125-147, 157-163, 170-175, 177-188, 226-232, 245-252, 258-274, 320-335, 348-359 
and 1-71 of Seq ID No 484; 13-40, 43-71, 76-83, 87-101, 109-119, 125-156, 162-175, 182-219, 226-232, 
240-262, 270-287, 306-318, 326-342, 344-408, 414-444, 449-456 and 1-51 of Seq ID No 485; 4-16, 18-34, 
45-54, 99-108, 134-140, 203-212, 241-257, 266-274, 279-291, 308-315, 330-336, 355-370, 374-382, 402- 
410, 428-455, 466-472, 474-480, 531-554, 560-566, 572-580, 597-618, 632-660, 664-674, 676-685, 691-705, 
708-735, 750-768, 1-87 and 342-480 of Seq ID No 486; The serum reactive epitopes as specified in 
the column of "aa from" to "aa to" of Table 6, especially peptides comprising amino acid 115-132 
and 1-26 of Seq ID No 475; 33-55 of Seq ID No 476; 1-25 of Seq ID No 478; 37-61 of Seq ID No 479; 
1-24 of Seq ID No 480; 1-23 of Seq ID No 481; 46-60 of Seq ID No 482; 1-28, 23-50 and 45-71 of Seq 
ID No 483; 1-22 and 17-38 of Seq ID No 484; 1-22 and 17-38 of Seq ID No 485; 1-27, 22-47 and 422- 
447 of Seq ID No 486; The immunogenic epitopes as specified in the column of "aa from" to "aa 
to" of Table 7, especially peptides comprising amino acid 115-132 and 1-47 of Seq ID No 475; 1- 
55 of Seq ID No 476; 22-85 of Seq ID No 477; 307-320 and 1-44 of Seq ID No 478; 15-76 and 40-92 
of Seq ID No 479; 1-59, 213-269 and 403-445 of Seq ID No 480; 1-56 and 85-105 of Seq ID No 481; 
37-121 of Seq ID No 482; 1-71 of Seq ID No 483; 1-38 of Seq ID No 484; 1-38 of Seq ID No 485; 1- 
47 of Seq ID No 486. 

15. A process for producing a S. agalactiae hyperimmune serum reactive antigen or a fragment thereof 
according to any one of the claims 11 to 14 comprising expressing the nucleic acid molecule 
according to any one of claims 1 to 7. 

16. A process for producing a cell, which expresses a S. agalactiae hyperimmune serum reactive antigen 
or a fragment thereof according to any one of the claims 11 to 14 comprising transforming or 
transfecting a suitable host cell with the vector according to claim 8 or claim 9. 

17. A pharmaceutical composition, especially a vaccine, comprising a hyperimmune serum-reactive 
antigen or a fragment thereof, as defined in any one of claims 11 to 14 or a nucleic acid molecule 
according to any one of claims 1 to 7. 

18. A pharmaceutical composition, especially a vaccine, according to claim 17, characterized in that it 
further comprises an immunostimulatory substance, preferably selected from the group 
comprising polycationic polymers, especially polycationic peptides, immunostimulatory 
deoxynucleotides (ODNs), peptides containing at least two LysLeuLys motifs, neuroactive 
compounds, especially human growth hormone, alumn, Freund's complete or incomplete 
adjuvants or combinations thereof. 

19. Use of a nucleic acid molecule according to any one of claims 1 to 7 or a hyperimmune serum- 
reactive antigen or fragment thereof according to any one of claims 11 to 14 for the manufacture of 
a pharmaceutical preparation, especially for the manufacture of a vaccine against S. agalactiae 
infection. 

20 . An antibody, or at least an effective part thereof, which binds at least to a selective part of the 



WO 2004/099242 



PCT/EP2004/004856 



- 102- 

hyperimmune serum-reactive antigen or a fragment thereof according to any one of claims 11 to 
14. 

21. An antibody according to claim 20, wherein the antibody is a monoclonal antibody. 

22. An antibody according to claim 20 or 21, wherein said effective part comprises Fab fragments. 

23. An antibody according to any one of claims 20 to 22, wherein the antibody is a chimeric antibody. 

24. An antibody according to any one of claims 20 to 23, wherein the antibody is a humanized 
antibody. 

25. A hybridoma cell line, which produces an antibody according to any one of claims 20 to 24. 

26. A method for producing an antibody according to claim 20, characterized by the following steps: 

• initiating an immune response in a non-human animal by administrating an hyperimmune 
serum-reactive antigen or a fragment thereof, as defined in any one of the claims 11 to 14, to 
said animal, 

• removing an antibody containing body fluid from said animal, and 

• producing the antibody by subjecting said antibody containing body fluid to further 
purification steps. 

27. Method for producing an antibody according to claim 21, characterized by the following steps: 

• initiating an immune response in a non-human animal by administrating an hyperimmune 
serum-reactive antigen or a fragment thereof, as defined in any one of the claims 12 to 15, to 
said animal, 

• removing the spleen or spleen cells from said animal, 

• producing hybridoma cells of said spleen or spleen cells, 

• selecting and cloning hybridoma cells specific for said hyperimmune serum-reactive antigens or 
a fragment thereof, 

• producing the antibody by cultivation of said cloned hybridoma cells and optionally further 
purification steps. 

28. Use of the antibodies according to any one of claims 20 to 24 for the preparation of a medicament 
for treating or preventing S. agalactiae infections. 

29. An antagonist, which binds to the hyperimmune serum-reactive antigen or a fragment thereof 
according to any one of claims 11 to 14. 

30. A method for identifying an antagonist capable of binding to the hyperimmune serum-reactive 
antigen or fragment thereof according to any one of claims 11 to 14 comprising: 

a) contacting an isolated or immobilized hyperimmune serum-reactive antigen or a fragment 
thereof according to any one of claims 11 to 14 with a candidate antagonist under conditions to 
permit binding of said candidate antagonist to said hyperimmune serum-reactive antigen or 
fragment, in the presence of a component capable of providing a detectable signal in response to 
the binding of the candidate antagonist to said hyperimmune serum reactive antigen or fragment 
thereof; and 

b) detecting the presence or absence of a signal generated in response to the binding of the 
antagonist to the hyperimmune serum reactive antigen or the fragment thereof. 

31. A method for identifying an antagonist capable of reducing or inhibiting the interaction activity of 
a hyperimmune serum-reactive antigen or a fragment thereof according to any one of claims 11 to 
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14 to its interaction partner comprising: 

a) providing a hyperimmune serum reactive antigen or a hyperimmune 
fragment thereof according to any one of claims 11-14, 

b) providing an interaction partner to said hyperimmune serum reactive antigen or a fragment 
thereof, especially an antibody according to any one of the claims 20 to 24, 

c) allowing interaction of said hyperimmune serum reactive antigen or fragment thereof to said 
interaction partner to form a interaction complex, 

d) providing a candidate antagonist, 

e) allowing a competition reaction to occur between the candidate antagonist and the interaction 
complex , 

f) determining whether the candidate antagonist inhibits or reduces the interaction activities of the 
hyperimmune serum reactive antigen or the fragment thereof with the interaction partner. 

32. Use of any of the hyperimmune serum reactive antigen or fragment thereof according to any one of 
claims 11 to 14 for the isolation and/or purification and/or identification of an interaction partner of 
said hyperimmune serum reactive antigen or fragment thereof. 

33. A process for in vitro diagnosing a disease related to expression of the hyperimmune serum- 
reactive antigen or a fragment thereof according to any one of claims 11 to 14 comprising 
determining the presence of a nucleic acid sequence encoding said hyperimmune serum reactive 
antigen and fragment according to any one of claims 1 to 7 or the presence of the hyperimmune 
serum reactive antigen or fragment thereof according to any one of claims 11-14. 

34. A process for in vitro diagnosis of a bacterial infection, especially a S. agalactiae infection, 
comprising analysing for the presence of a nucleic acid sequence encoding said hyperimmune 
serum reactive antigen and fragment according to any one of claims 1 to 7 or the presence of the 
hyperimmune serum reactive antigen or fragment thereof according to any one of claims 11 to 14. 

35. Use of the hyperimmune serum reactive antigen or fragment thereof according to any one of 
claims 11 to 14 for the generation of a peptide binding to said hyperimmune serum reactive 
antigen or fragment thereof, wherein the peptide is selected from the group comprising anticalines. 

36. Use of the hyperimmune serum-reactive antigen or fragment thereof according to any one of 
claims 11 to 14 for the manufacture of a functional nucleic acid, wherein the functional nucleic acid 
is selected from the group comprising aptamers and spiegelmers. 

37. Use of a nucleic acid molecule according to any one of claims 11 to 14 for the manufacture of a 
functional ribonucleic acid, wherein the functional ribonucleic acid is selected from the group 
comprising ribozymes, antisense nucleic acids and siRNA. 
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SEQUENCE LISTING 

SeqID 1 

atgaagaaag ttttaacctt cttattatgt agcttatatt ttgtatcaat cccagctata 6 0 

agtaccgagg agccattaac gctttctcaa aatagacgat acgccfctaac tcagacagtt 12 0 

gtggataagg aaatgtafctt tgatgctatt ccagagagac caactacaaa aattgaaata 18 0 

agcagttttc aagatgaagc tttaaccata acaggagaga ctttagttcc aaatacttta 240 

ttgtcgattg tttctcttac aattaattca aatggaattc cagtatttac tttgtccaat 300 

ggacagttca taaaggctag tcgtgaagct atttttaatg atttagtttc taagcagcaa 360 

tctgtctoat tggattattg gttaaaacct tcttttgtga cttatgaagc accttacaca 42 0 

aatggtgtta gtgaagttaa aaataattta aaaccttatt ccagagtgca tttagtagaa 48 0 

caagcagaaa cagaacatgg tatttattat aaaacggaca gtggtttttg gatatcagta 54 0 

gaagatctat cggttgcfcga taatcgtatg gctaaggttc aagaagtatt attggaaaaa 600 

tataataaag ataaatatgg tatatacatc aaacaattaa atacacagac agtagctggt 660 

attaatattg ataggtcaat gtactctgca agtattgcaa aattagcaac actttatgct 72 0 

agccaagagc aagtaaaatt aggaaaactg tcattggaca gcaaatttga atacaaagat 78 0 

aatgttaatc aatttcctaa tagctatgat ccaagtggta gtgggaaatt agagaaaaaa 840 

gctgatcata aattatatac cgttaaggag ttacttgaag caactgccaa ggaatcagat 90 0 

aatgttgcaa caaatatgct aggttattat gtcaataatc aatatgattc aatgtttcaa 960 

acacaagtag acacaatttc tggtatgcac tgggatatga aaaaacgcca gatttctcca 102 0 

caagctgcgg gtaaaatgat ggaagcaatt tattatcaaa afcggagatat tgttaattac 108 0 

ctatcaaaga ctgattttga taatacaaga atccctaaaa atataccagt taaagtagca 114 0 

cataaaattg gagatgctta tgactacaaa catgatgcgg cfcattgttta tgcagaacaa 12 00 

ccttttatta tgattatctt tacggataaa tcatcttacg atgatatcac taagattgct 12 6 0 

gacgacgttt accaggtttt aaaataa 128 7 

SeqID 2 

atgaaaaaaa gaatattatc agcagtcctt gtgagtggag taactcttgg aactgctgcc 60 

gtaacagtaa acgctgatga ctttgactcg aaaattgctg cfcactgattc agtaatcaat 120 

actttatcag ggcaacaagc agcagcacaa aaccaggtga ctgctattaa aggacaagtt 18 0 

ggtgctttag agtctcaaca atcagaatta gaagcgcaaa atgctcaact tgaggctgta 240 

tctcagcagt taggtcaaga gattcaaact ctttctaata aaattgttgc acgtaacgaa 3 00 

tctttgaaaa agcaagttcg tagtgctcaa aagggtaact tgactaacta tattaatact 3 60 

attfctaaatt cgaaatcagt atctgatgct gttaatcgtg ttgttgctat tcgtgaggtt 420 

gtttcagcta atgaaaagat gttagcacaa caagaggcfcg acaaagcggc cttagaagct 48 0 

aaacaaattg aaaatcaaaa tgctataaat actgtagcag ctaacaagca agctatcgaa 540 

aataataaag cagccttagc aacacaaaga gcacaattag aagcagctca attagagtta 60 0 

tctgctcaat tgactacagt ccaaaatgaa aaagcatctt taattcaggc aaaagcacaa 660 

gctgaggaag ctgcgcgtaa ggcagctgaa gcacaagcag cagctgaagc gaaagcacaa 720 

gctgaagcga aagcacaagc tgagtctgta gcaaaagcac aggcagcagc acaagttgag 78 0 

tctgccacag ctcctactga aacagttcaa acacaaccga gaactgaaat aaagccttct 84 0 

aatcttacgg ctacttcttc ggcaactact gttgctacaa ctacagctac agctactaat 900 

gagccaaaag ttactcaacc ttctgtagta actaaagcag ttgaagcacc taaagcagtt 960 

gtttcttcaa cacctagagc ggtatcaaaa cctgttgtca gaagtfcatga ttcaagtaac 1020 

acatacccaa tgggacaatg tacttgggga gctaagtcaa tggcttcttg ggttggtaac 108 0 

tattggggaa atgctaatca gtggggtgct agtgcacgtg cagctggtta ttctgtagga 1140 

actacccctc gtgttggtgc agtagctgtt tggccatatg atggtggtgg ttacgggcac 1200 

gttgcagtag taacgtcagt ggctaacaat tcatctatcc aagttatgga atcaaactat 1260 

gctggaaata tgagtattgg caafctaccgt ggttcattta accctagtgc ttctggctct 132 0 

gtttactata tttacccaaa ttaa 1344 

SeqID 3 

atgaataaac gtatttttgt tgagaaaaag gctgactttg acattaaatc ggctagtctt 60 

gtgaaagagt tgacgcataa tctacaactg acttctttga aggatttgcg tattgtgcag 12 0 

gtctatgatg tcttcaattt ggctgaggat ttgctggcgc gtgctgagaa gcatattttc 180 

tctgagcagg tgacagactg tcttttgacg gaaactgaaa tcactgcgga gcttgataag 240 

gttgccttct ttgccattga ggcgcttcct ggtcaatttg accaacgtgc tgctagttcg 3 00 

caagaatctt tgcfcattgct tggaagtgac agtcaggtta aggtcaatac agcccagctt 360 

tacttggtca ataaggatat tgcagaagca gagcttgaag ccgttaagaa ctatcttttg 42 0 

aatcctgttg attcacgttt caaggacatt accttgccgc ttgaagagca ggctttctct 48 0 

gtgtctgata agacaatccc tagtcttgat ttctttgaaa cttataaggc tgacgatttt 54 0 

gcggcctata aggcagagca gggcttggct atggaggtcg atgatcttct cttcatccaa 600 

gattatttca aatcaatcgg gcgtgtgcca actgagactg agttgaaagt tttggatact 66 0 

tactggtcag accactgccg tcacacaacc tttgaaactg agttgaaaaa catcgacttt 72 0 

tcagcttcta aattccaaaa acaattgcag gcgacttatg acaaatatat cgccatgcgt 78 0 
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gatgaacttg gtcgttctga aaagccgcag acacttatgg atatggcgac tatctttggt 84 0 

cgttatgaac gtgccaatgg acgactggat gatatggaag tctcagatga aatcaatgcc 90 0 

tgctcagttg aaattgaagt agatgttgat ggtgtaaaag agccttggct cctcatgttt 960 

aagaatgaga ctcacaatca cccaacagaa attgagccat ttggtggcgc agcaacttgt 102 0 

atcggtggtg ccattcgtga cccgttgtca ggacgttcat acgtttatca ggctatgcgt 108 0 

atttcaggcg caggcgatat tacgactccg attgcggaaa cacgtgctgg taaattgccg 114 0 

caacaggtta tttctaaaac tgcggcgcac ggctattctt catatggtaa ccagattggg 120 0 

cttgcgacaa cttatgtgcg tgagtacttt caccctggct ttgtagccaa acgcatggag 126 0 

cttggagctg tggttggtgc tgcacctaag gaaaatgtgg ttcgtgaaaa accagaagca 132 0 

ggcgatgtgg tcatcttgct cggtggtaaa acaggtcgtg acggtgtcgg cggtgcgaca 13 8 0 

ggttcatcta aggttcaaac ggttgaatct gtggaaacag ctggcgcaga ggtacaaaaa 1440 

gggaatgcca tcgaagaacg taagattcaa cgccttttcc gtaatggtaa tgtcactcgc 15 00 

ctaatcaaga aatcaaatga cttcggtgca ggtggtgtct gtgttgccat cggtgaattg 15 60 

gctgacggtc ttgaaatcga tttggacaag gtaccgctta aataccaagg tcttaatggt 162 0 

actgaaattg caatctcaga atcacaagag cgtatgtcag tcgtfcgttcg tcctagtgac 1680 

gtggatacct tcatcgcagc ctgcaacaag gaaaatatcg atgccgttgt ggttgctact 174 0 

attactgcaa aaccaaacct tgtcatgact tgggatggtg aaacaattgt tgacttggaa 18 0 0 

cgccgtttcc ttgataccaa cggtgtccgt gtcgttgttg atgctaaagt cgttgacaaa 18 6 0 

gacttgacag ttccagaggt acgcacaaca tcagcagaga cacttgaagc ggatacgctt 192 0 

aaggtcttgt ctgacctcaa ccacgctagt caaaaaggtc tccaaactat ctttgactca 198 0 

tctgttggtc gctcaaccgt taaccaccca atcggtggcc gttaccaaat cacaccaaca 2 04 0 

gaaagctctg ttcaaaaatt gccagttcaa cacggtgtca caacaactgc atctgttatg 210 0 

gctcaaggtt acaatcctta tattgcagaa tggtcacctt accacggtgc tgcctatgct 216 0 

gtcattgaag cgacagctcg cttggtagca acgggtgctg actggtctcg tgcacgtttc 222 0 

tcttaccaag agtatttcga gcgtatggat aaacaggcag agcgttttgg tcagccagta 228 0 

tcagctctto ttggttctat tgaggctcag attcaacttg gattgccatc aatcggcggt 234 0 

aaggactcta tgtctggtac tttcgaagaa ttgacagtac cgccaacctfc ggtagctttc 2400 

ggtgtgacaa cagcggacag ccgcaaggtt ctctctcctg agtttaaagc ggctggtgag 24 6 0 

aatatctact acattccagg tcaagctatt tcagaagata ttgattttga ccttatcaag 252 0 

gctaacttta gccagttcga gactattcaa gctcaacata agattactgc tgcttcagct 25 8 0 

gttaaatacg gtggtgtcct agaaagtctt gctctcatga cttttggtaa ccgtafccggg 264 0 

gcctctgttg aaattgcaga gcttgacagt agcttgacag ctcaactcgg aggctttgtc 2700 

tfctacatcag ctgaggaaat tgctgactcg gtgaaaatcg gtcaaactca ggcagccttt 2760 

acacttactg tcaatggaaa tgaccttgct ggggcaagcc ttctatcagt cttcgaaggc 282 0 

aaattggaag aggtttaccc aacagagttt gaacaggcag acgctcttga agaagttcct 28 8 0 

gctgtggtat cagatactgt tatcaaggct aaggaaacaa ttgaaaaacc agtggtttac 294 0 

attccagtct tccctggtac caactcagaa tatgattcag ctaaggcctt tgaacaggtt 3 000 

ggagctagtg tcaacttggt agcatttgta accttgaatg aggctgctat tgctgactca 3 0 60 

gttgacacaa tggtcgctaa tattgctaag gcaaatatca tcttctttgc tggaggattc 3120 

tcagcagcgg atgaaccaga tgggtctgct aagtttatcg tcaatatctt gcttaacaag 318 0 

aaggttcgcg cagctattga cagcttcatc gaaaaaggtg gccttatcat cggtatctgt 324 0 

aatggtttcc aagcccttgt taaatcaggt cttcttccat acggaaactt cgaggaagct 3300 

ggtgagacaa gtccaaccct cttctataac gatgccaacc agcacgttgc caagatggtt 33 6 0 

gagactcgta tcgcaaatac caactcacct tggttggtag gagttgaggt cggcgatatt 342 0 

catgccatcc cagtttcaca cggtgaaggt aaatfctgttg tcagcgcttc tgaatttgca 3480 

gagctaagag acaatggtca aatctggagc caatatgtgg actttgacgg acaaccatct 354 0 

atggattcta aatacaatcc aaatggctct gtcaatgcca tcgaagggat taccagcaag 3 600 

aatggtcaaa tcatcggtaa gatggggcac tcagaacgct gggaagacgg cctcttccaa 3660 

aatatccctg gtaacaaaga ccaagccttg tttgctagtg cagtaaaata ttttacagga 372 0 

aaataa 3 72 6 

SeqID 4 

atgaaaatga ataaaaaggt actattgaca tcgacaatgg cagcttcgct attatcagtc 60 

gcaagtgttc aagcacaaga aacagatacg acgtggacag cacgtactgt ttcagaggta 120 

aaggctgatt tggtaaagca agacaataaa tcatcatata ctgtgaaata tggtgafcaca 18 0 

ctaagcgtta tttcagaagc aatgtcaatt gatatgaatg tcttagcaaa aataaataac 24 0 

attgcagata tcaatcttat ttatcctgag acaacactga cagtaactta cgatcagaag 3 00 

agtcatactg ccacttcaat gaaaatagaa acaccagcaa caaatgctgc tggtcaaaca 360 

acagctactg tggatttgaa aaccaatcaa gtttctgttg cagaccaaaa agtttctctc 42 0 

aatacaattt cggaaggtat gacaccagaa gcagcaacaa cgattgtttc gccaatgaag 48 0 

acatattctt ctgcgccagc tttgaaatca aaagaagtat tagcacaaga gcaagctgtt 54 0 

agtcaagcag cagctaatga acaggtatca ccagctcctg tgaagtcgat tacttcagaa 600 

gttccagcag ctaaagagga agttaaacca actcagacgt cagtcagtca gtcaacaaca 660 

gtatcaccag cttctgttgc cgctgaaaca ccagctccag tagctaaagt agcaccggta 720 

agaactgtag cagcccctag agtggcaagt gttaaagtag tcactcctaa agtagaaact 780 
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ggtgcatcac cagagcatgt atcagctcca gcagttcctg tgactacgac ttcaccagct 840 

acagacagta agttacaagc gactgaagtt aagagcgttc cggtagcaca aaaagctcca 9 00 

acagcaacac cggtagcaca accagcttca acaacaaatg cagtagctgc acatcctgaa 9 60 

aatgcagggc tccaacctca tgttgcagct tataaagaaa aagtagcgtc aacttatgga 1020 

gttaatgaat tcagtacata ccgtgcggga gatccaggtg atcatggtaa aggtttagca 1080 

gfctgacttta ttgtaggtac taatcaagca cttggtaata aagttgcaca gtactctaca 1140 

caaaatatgg cagcaaataa catttcatat gttatctggc aacaaaagtt ttactcaaat 1200 

acaaacagta tttatggacc tgctaatact tggaatgcaa tgccagatcg tggtggcgtt 12 60 

actgccaacc actatgacca cgttcacgta tcatttaaca aataa 13 05 

SeqID 5 

atgttaaaac attttggtag taaagtaaga aatttgagag taactagaaa tattactcgc 60 

gaagattttt gtggtgatga aactgagtta tctgtaagac aattagctag gattgaaagt 120 

ggacagtcaa ttccaaactt gactaaagct cactacatag ctaaacaatt gaatgtaaaa 180 

ttggatattt taacaggagg agagagctta gaattaccta aacggtataa agagttgaag 240 

tacaaattgt tgcgtacccc gacttacgga gatgctaata gacttgcagt tagagaagca 3 00 

tattttgatg aaatctatga agtattttat gaggagctcc cagaagatga aagactaatt 3 60 

attgattgta tgcaatcaaa attagatgtt catttttctg ttaatgataa ttttggaatc 420 

actattttac atgactactt tgatcaaatt aagaaaaaaa aggaatafcac cacaaatgat 480 

tttgttatga tagatttata tttactttgc ttttctatta actatggaat gaagagcttg 540 

tattctttag agaactatca ttttattatg tcaaaattgt tagagcaaga taatttactt 600 

cctgaggata attttcaatt gaacaacgtt ttattaaatc atgtcgaatt agcttttcaa 660 

tttaagcaga aaaaatatgt acagcaaatt attcatagaa gtaatgctat tatgacagaa 72 0 

atccatgatt ttcaaaaaag accaatacta agcttaatag agtggaaata tttattaatt 780 

attgaaaaag atagaacaaa agcagaaaca tgttttaaac aatcaattct atttgcagag 840 

ttaataggtg atctatattt aaaaggaaaa ttaatagagg agtggaataa agatttaaca 900 

tga 9 03 

SeqID 6 

atgactgaaa aaactaaagc ggtagaaact acagatgttg ctcttgcaat tgatacattg 60 

gttcaaaacg gtttaaaagc acttgatgag atgcgtcaac ttaatcaaga acaagttgat 120 

tatattgtag ctaaagcttc tgttgcagcc ttggatgccc atggagaact tgctcttcat 18 0 

gcagtagaag aaactggtcg tggtgtcttt gaagataaag cgacaaaaaa tttatttgca 240 

tgtgaacacg tcgttaacaa catgcgtcat acaaagacag tgggagttat tgaagaagac 3 00 

gatgtcacag gcttgacatt aattgctgaa ccagtgggag ttgtctgtgg tattacgcca 3 60 

acaaccaacc cgacttcaac tgctattttc aagtcattaa tctctcttaa aacacgtaat 420 

ccaattattt ttgccttcca cccatcagca caagaatcat cagcacatgc agcacgtatt 480 

gttcgtgatg ctgctattgc tgctggagct cctgaaaatt gtgtccaatg gattgaacaa 540 

ccatcgattg atgcaactaa tgctttgatg aaccatgatg gtattgccac tatccttgcg 600 

acaggtggta atgcgatggt taaagctgcc tactcatgtg ggaaacctgc ccttggtgta 660 

ggtgctggga atgttccagc ttatgttgaa aaatcagcta atatccgaca agccgctcac 720 

gatattgtta tgfcctaaatc atttgataac ggtatggtat gtgcatcaga acaggctgtt 78 0 

attattgata aagaaattta taaagaattt gttgaagaat ttaaatctta tcatacttat 84 0 

ttcgttaata agaaagaaaa agctcttctt gaagaattct gttttggtgc aaaagcaaat 900 

agcaaaaact gtgctggtgc aaaattaaat ccaaatattg ttggaaaatc agcagtttgg 96 0 

attgctgaac aagctggatt tactgttcca gaaggaacaa atattcttgc cgcagaatgt 102 0 

actgaagtta gtgaaaaaga acctttgaca cgtgaaaaac tatcacctgt tattgccgta 108 0 

ttgaaagctg aatctacaga agatggtgtt gaaaaagctc gtcaaatggt tgagtttaat 114 0 

ggtctaggac actcagctgc tattcataca aaagatgcag acttagctcg tgaatttgga 12 0 0 

actagaattc gtgctatccg tgttatttgg aattctcctt ctacttttgg tggtattggg 1260 

gatgtttaca atgctttctt gccatcattg actcttggtt gtggttcgta tggacgtaac 13 2 0 

tcagttggtg ataacgttag tgctataaat ctcttgaafca tcaaaaaagfc aggaagacgt 13 8 0 

agaaataata tgcaatggtt taaagtccct tcaaaaacat acttcgaacg tgattctatt 144 0 

caatatttgc aaaaatgtcg cgatgttgaa cgtgtcatga ttgttacaga tcacgccatg 15 0 0 

gttgagcttg gtttcttgga tcgtatcatt gaacaacttg atcttcgtcg caataaagtt 1560 

gtttatcaga tttttgctga agtagaacca gatccagaca ttacaacagt aatgaaagga 162 0 

acagatttga tgcgtacctt caaaccagac actattattg cattaggtgg tggttctcct 1680 

atggatgctg cgaaagttat gtggctcttc tatgaacaac cagaagttga tttccatgac 174 0 

ctcgttcaaa aattcatgga tatccgtaaa cgtgcgttca aattcccaga acttggtaag 1800 

aaaacaaaat ttgttgctat tccaacaact tctggaacag gttctgaagt gacaccattt 1860 

gccgttatct ctgacaaagc aaataatcgt aaatatccaa tcgctgatta ttcattaaca 1920 

ccaactgtag ctatcgttga tccggccctt gttatgactg ttcctggttt tattgcggct 198 0 

gatacgggta tggatgtatt gacacatgca acagaggctt atgtttcaca gatggcaaat 204 0 

gattatacag acggacttgc ccttcaagca afctaaaattg tgtttgacta cttagaacgc 210 0 

tcagtaaaag atgctgattt tgaagcacgt gaaaagatgc acaatgcgtc aactatggca 2160 
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ggtatggctt tcgccaatgc attcttaggt atttctcact caatggcgca taaaattggc 222 0 

gcacaattcc ataccgtaca cggacgtaca aatgcaattc ttttgccgta tgttatccgc 22 8 0 

tataatggaa ctcgtccagc taaaacagca acatggccta agtacaatta ctatcgtgct 2340 

gatgaaaaat atcaagatat cgctaaatta ctcggcttac cagcagcaac accagaagaa 2400 

gctgtcgaat catatgcaaa agctgtgtat gaccttggaa cacgtttagg tattaagatg 2460 

aatttccgcg accaaggcat tgatgaaaaa gaatggaaag aaaaatctcg tgaactagct 2520 

ttcttagcat atgaagatca atgttcacca gcaaatccac gccttccgat ggtagatcac 2 580 

atgcaagaaa tcatcgaaga tgcttattat ggttatgagg aacgtccagg acgtcgtaag 2640 

taa 2643 

SeqID 7 

gtgggtatta aagtttataa accaacgaca aatggccgtc gtaatatgac ttctttggat 6 0 

tttgcagaaa tcactacaaa cactcctgag aaatcattgc ttgtttcact aaagaataaa 12 0 

gcaggacgta acaacaacgg acgtattact gttcgtcacc aaggtggtgg acacaaacgt 18 0 

cattaccgtt tgattgactt caaacgtaat aaagatggtg ttgaagcagt tgttaaaaca 24 0 

attgaatacg atccaaatcg tactgcaaat attgctcttg tacactatac tgatggtgtt 3 00 

aaagcttaca ttcttgcgcc taaaggcctt gaagtaggtc aacgcattat ttctggtcca 360 

gaagcagata tcaaagttgg taacgcactt ccacttgcta a.cattccagt cggtacagtt 42 0 

atccataata ttgaattgca accaggtaaa ggtgctgaat tgattcgtgc cgcgggtgct 48 0 

tctgctcaag tacttggtca agaaggtaaa tatgttcttg ttcgccttca atcaggtgaa 54 0 

gtacgtatga tccttggtac ttgccgtgca acaattggta ctgttggtaa tgaacaacaa 60 0 

tcacttgtta acattggtaa agcaggacgt aatcgttgga aaggtgttcg ccctacagtt 660 

cgtggttctg taatgaaccc taatgatcac ccacacggtg gtggtgaagg taaagcacca 72 0 

gttggtcgta aagcaccatc tactccatgg ggtaaacctg cacttggact taaaactcgt 78 0 

aataagaaag ctaaatcaga caaacttatt gttcgtcgtc gtaaccaaaa ataa 834 

SeqID 8 

atgattgagt ttgaaaaacc aataataaca aaaattgatg aaaataaaga ttacggcaga 6 0 

tttgtcatcg aaccactaga acgtggctac ggtacaactc taggaaactc tcttcgtcgt 120 

gtgcttctgt cttcacttcc aggtgcagca gtcacatcaa tcaaaattga tggagtatta 18 0 

cacgaatttg atactatccc aggtgtacgt gaagatgtga tgcaaatcat ccttaatgta 240 

aaaggacttg ctgtgaaatc ttacgtcgaa gacgaaaaga ttatcgaact agacgttgaa 300 

ggtcctgcag aaattactgc cggagatatt ttaacagata gtgatatcga aattgtaaac 3 60 

cctgaccatt atcttttcac aattgctgaa ggtcacagtt tgaaagcaac aatgacagtt 42 0 

gctaaaaacc gaggttatgt tccagcagaa ggaaataaaa aagatgatgc accagtggga 48 0 

acattggctg tagattcaat ctatacacca gtgaaaaaag ttaattatca agttgaacct 540 

gctcgtgtag gtagtaacga tggctttgat aaattaacta ttgaaatcat gacaaatggc 600 

acaatcattc ctgaggatgc tcttggtctt tcagctcgcg tcttaattga acacttaaac 66 0 

ttgtttactg atttaacaga ggttgctaaa gcaactgaag taatgaagga aacagagaaa 72 0 

gtgaacgatg agaaagtgct tgaccgcaca attgaggaac ttgatttgtc agtgcgctca 78 0 

tataactgtt taaaacgtgc tggaattaat acagtatttg atttaacaga aaaaactgag 84 0 

cctgaaatga tgaaagtccg aaacttagga cgtaagagtc ttgaagaagt taaaatcaaa 90 0 

cttgccgatc taggtcttgg attaaaaaac gataaataa 93 9 

SeqID 9 

atgaaaaaga ttcgattatc aaagtttatt aaaatgattg ttgttatttt gtttttaatt 6 0 

agtgtagcag ctagttttta ttttttccac gttgcccaag ttcgagatga taaatccttt 120 

atttcaaatg gtcaacgtaa gcctggaaac tctttatatg cttatgataa atcctttgat 180 

aagctattaa agcaaaaaat agaaatgaca aaccaaaata taaagcaagt tgcttggtat 240 

gttcctgctg ctaagaaaac tcataagaca gctgttgtcg ttcatggttt tgcgaatagc 3 00 

aaagagaata tgaaggcata tggttggctg tttcataagt taggatacaa tgttcttatg 36 0 

cc tgacaata ttgcacatgg tgaaagtcat gggcagttga taggctatgg ctggaacgac 42 0 

cgcgagaaca ttatcaaatg gacagaaatg atagttgata agaatccatc aagccaaatt 48 0 

actttatttg gtgtttcaat gggtggagca acagtcatga tggctagtgg tgaaaaatta 54 0 

cctagtcagg ttgttaatat cattgaagat tgcggttatt ctagtgtttg ggatgaatta 60 0 

aaatttcagg ctaaagagat gtatggttta ccagccttcc cactcttata tgaagtttca 660 

acaatttcta aaatcagagc aggtttttcg tatggacaag caagtagtgt cgaacaattg 72 0 

aaaaagaata atttaccagc cctctttatt catggtgata aggataattt tgttccaaca 78 0 

agtatggttt atgacaacta taaagctaca gcaggtaaga aagagcttta tattgtaaaa 84 0 

ggggcaaaac atgcgaaatc ttttgaaaca gagccagaaa aatatgagaa acgtatctct 900 

agttttttga aaaaatatga aaaataa 92 7 



SeqID 10 

atgtcaaaag tacgtggctt tgaattagtt agccaattta gcaataaaga gttattacca 
aaacgagaaa cagctcacgc agctggctat gatttaaaag tagctaaaaa aacggttatt 



60 
120 
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gaacctggtg aaattacatt ggtcccgacc ggtattaaag cacacatgca accaggggag 18 0 

gtcctgtacc tttatgatcg ttcatcgaat ccacgtaaaa aagggattgt cctaataaat 240 

tcagttggtg ttattgatgg tgactactat aataaccaag fccaatgaagg acatattttt 3 00 

gcacagatgc aaaatattac tgatcaggct gttattttag aagaagggga acggattgtt 3 60 

caggctgttt ttgccccatt cttattggca gatgacgatc aggctacagg aatgcgtaca 42 0 

ggtggttttg gtagtactgg aaaataa 44 7 

SeqID 11 

atgaaatttg gaaaaaaact tggctttcta gccttattga tgtctattgt cctcattcta 60 

ggtgcatgtg gtaaaactgg acttggaaat tctactggga attctacaaa aaatgtaacc 120 

aaaaaatctg ctaaaaactt gaaactaggt gtatctattt caacaactaa caatccctac 18 0 

tttgttgcga tgaaagacgg tattgacaaa tatgctagca ataaaaaaat aagcattaag 240 

gtagctgatg ctcaagatga cgccgcacgt caagcagatg atgtccaaaa ctttattagt 3 00 

caaaacgtag atgctatttt gattaatccc gttgactcaa aagctattgt aacagctatt 3 60 

aagtctgcaa acaacgctaa tattcctgtt attctaatgg accgtggtag tgagggaggc 42 0 

aaagtattaa ctactgtcgc ttcagataac gttgctgctg gtaagatggc tgcggattat 48 0 

gctgttaaaa aactaggcaa aaaagcaaaa gcctttgaat tatctggagt accaggtgcc 540 

tctgctactg tagaccgtgg taaaggattt cattcagttg caaaatctaa acttgacatt 600 

ctttcaagcc aatctgccaa ttttgaccgc gccaaagctt tgaatactac acaaaatatg 660 

attcaaggac ataaagatgt ccaaattatt ttcgcacaaa atgatgaaat ggcacttggg 72 0 

gctgcacagg ctgtcaaatc tgctggcttg caaaatgttc ttatcgtagg aattgatggt 780 

cagccagatg cccatgatgc cattaaaaaa ggagatattt cagccactat tgcacaacaa 840 

ccagctaaga tgggagaaat agcaattcaa gcagcaattg atcattataa aggcaaaaaa 9 00 

gtagagaaag aaacaattto tccaatttac cttgttacca aagacaatgt tgaaaaatac 960 

aattggtaa 969 

SeqID 12 

atgggaaaag aaaaattaat tttagcttat tctggtggtt tagatacctc tgttgctata 6 0 

gcttggttaa aaaaagatta tgacgttatt gctgtctgta tggatgttgg cgaaggtaaa 12 0 

gatcttgatt ttattcacga taaggcttta acaataggtg ccatagaatc ttatatttta 180 

gatgttaaag atgagttcgc tgagcatttt gtgttaccag ctttacaggc acatgcgatg 240 

tatgaacaaa aatacccgct tgtttcagcg cttagtcgtc caattattgc tcaaaaatta 3 00 

gttgaaatgg cacatcaaac aggtgcgacc acaattgccc atggatgtac aggtaaagga 3 60 

aatgaccaag tgcgttttga agttgcaatt gcagcactag atcctgaatt aaaagtaatt 420 

gcccctgttc gtgagtggaa gtggcatcgt gaagaggaaa ttacttttgc aaaagctaac 48 0 

ggcgtaccta ttccagcaga tttagataac ccttattcta fcagaccaaaa tttatgggga 540 

cgtgccaatg aatgtggtgt gcttgagaat ccttggaatc aagcgccaga agaagctttt 60 0 

ggtattacaa aatcaccaga agaagctcct gattgtgcag aatatattga tattacgttc 660 

caaaatggga agcctatagc tataaacaat caagaaatga cactagcaga tttgatttta 72 0 

tcgctaaatg aaattgcagg aaaacatggt attggacgca ttgafccatgt tgaaaatcgt 780 

ttagtcggta tcaaatcacg tgaaatttat gaatgtcctg cggcaatggt actgttagct 84 0 

gctcataaag aaattgaaga tttaacatta gtccgtgaag tgtctcattt taaacctatc 900 

cttgaaaatg aactatcaaa tcttatttat aatgctttat ggtttagtcc agctacaaaa 9 60 

gctatcattg cctatgtcaa agaaacacaa aaagtggtta atggtactac aaaagttaag 102 0 

ttatataaag gttctgccaa agtcgttgct cgccactcgt ctaattcatt atatgatgag 108 0 

aacttagcaa cttatacagc tgctgataac tttgatcaag atgcagcagt tggttttatc 114 0 

aaactctggg gacttccaac ccaagttaat gcacaagtaa acaaaggata a 1191 

SeqID 13 

atggctaaag tatgttattt cacaggacgt aaaacagttt ctggtaacaa ccgttcacac 60 

gcgatgaacc aaacaaaacg tactgttaaa ccaaaccttc aaaaagttac tgttcttatc 12 0 

gacggtaaac caaaaaaagt ttgggtttca gctcgtgcgc ttaaatctgg taaagtagaa 18 0 

cgcgtttaa 18 9 

SeqID 14 

atggcaaaac agaaaaataa ctggcgccgt gttggagttg gtgtccttac acttgcttca 60 

gttgcgactc ttgctgcatg tggaagtaaa tcagcttccc aggattctaa tggagcgatt 12 0 

aattgggcta ttccaacaga aatcaataca ctagatttat ctaaagttac agacacttac 18 0 

tcaaatctag ctattggtaa ctctagtagt aatttccttc gcttagataa ggatggaaag 24 0 

acgagaccag atttggctac taaagttgat gtttcaaaag atggtttaac ttatacagct 3 00 

acattacgta aaggcttgaa gtggtcagat ggcagtaaac ttactgcaaa ggattttgtt 3 60 

tattcatggc aacgtttagt tgatcctaag acagcttcac aatatgctta ccttgctgtt 42 0 

gaagggcatg tgcttaatgc cgataaaatc aacgaaggac aagagaaaga cttgaataag 48 0 

ctaggtgtta aggcagaagg cgatgacaaa gttgttatta ctttatctag tccgtctcca 54 0 

caattcatct actaccttgc attcactaat ttcatgccac aaaaacaaga agttgttgaa 60 0 
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aaatatggaa aagattacgc aactacttca aaaaatacag tttactcagg accatatact 6 60 

gttgaaggtt ggaatggttc aaatggfcact ttcaccctga agaaaaacaa aaattattgg 720 

gacgctaaaa atgtaaaaac aaaagaagtt cgcatccaga ctgttaaaaa accagatacc 780 

gccgttcaaa tgtataaacg tggtgaatta gatgcagcta atatctcaaa tacttctgct 840 

atttatcaag ctaataaaaa taataaagat gtcacagatg ttctagaagc gaccactgcc 900 

tatatgcaat ataatactac tggttctgtg aaagggcttg ataatgttaa gattcgtcgc 9 60 

gccttaaact tagcaactaa ccgtaaagga gttgttcaag cagccgttga tacaggctca 1020 

aaaccggcaa ttgcttttgc accgactggt ttagccaaaa caccagatgg aactgatttg 10 8 0 

gcaaaatatg ttgccccagg ttatgaatat aataaaactg aagcagcaaa actctttaaa 1140 

gaaggtttgg ctgaatcagg cttgactaag ctaaaattaa caattacagc agatgctgat 12 00 

gttectgctg ccaaaaactc tgttgactat atcaagtcta cttgggaagc tgctcttcca 1260 

ggacttactg ttgaagaaaa atttgtaacc tttaaacaac gcttagaaga cagtagaaaa 13 20 

caaaactttg atatcgtagt ttctgtttgg ggtggggatt atccagaagg gtcaactttc 13 8 0 

tacggcctct ttaagtcaga ttcacaaaat aacgatggaa aatttgctaa caaggactat 1440 

gacgctgctt ataacaaggc aatttccgaa gatgccttaa aaccagaaga atcagcaaag 1500 

gactataaag aagcagagaa aattctattt gagcaaggcg cttataaccc actctacttc 156 0 

cgtagtggta aaggtttaca aaatccaaaa ttaaaaggtg ttattcgtaa tactacaggt 1620 

ttgtcaatag actttacaca tgctta.taaa aaataa 1656 

SeqID 15 

atggaactct taaaaacacc catctttggt atttgctttt ctttaafcact ctatacgata 60 

ggacaacatt tatttaagaa gagtaaaggt ttcttccttt tgcagcctct tttctttgca 120 

atggttagtg gtattgtcat tctttggctt atgtcaaaag gtttaggaac cgatgttaag 18 0 

acattttata cacaagctta taaaccaggt ggcgatttaa tattttggtt tttaaatcca 240 

gcaacaattg cttttgcagt tcctctctat aagaaaaatg acgttgttaa aaaatattgg 3 00 

gtagaaattc tcagcagttt agtaatcggt atgattgttt cccttatgct tatcgtcgct 360 

atttctaaaa tggttgggct tagtcaagtc ggaattgctt caatgttgcc acaagcagca 420 

acaacagcaa ttgctcttcc aataacagca gcaattggag ggaacacagc tgtgacagca 48 0 

atggcgtgta tcttaaatgc agttattatt tatgcattag gtaaaaaatt agtgtcattt 540 

ttccatttga atgatagtaa gattggtgca ggattaggtc tagggacctc tggtcataca 600 

gtcggagcag cctttgcatt ggaattggga gaactgcaag gtgcgatggc agctatagcg 66 0 

gtggtggtta tcggtttggt agttgatttg gttattccta tctttagtca tttgattggt 72 0 

ttgctataa 729 

SeqID 16 

gtgactaaat atttgaagta catctctttt gttgctttgt ttctggcgag tatcttttta 60 

gtagcttgtc aaaatcaaaa ttcacaaaca aaggagcgaa cacgaaaaca acgacccaaa 12 0 

gatgaattgg ttgfcttctat gggggcaaag cttcctcatg aattcgatcc aaaggaccgt 18 0 

tatggaatcc ataatgaagg taatattact catagtaccc tattgaaacg ttctcctgaa 24 0 

ctagatataa aaggagagct tgctaaaaaa tataaaatct ctaaggatgg cttaacgtgg 300 

tcgttcgact taaatgatga ttttaaattc tccaatggtg agcctgttac tgctgacgat 3 60 

gttaagttta cttatgatat gttgaaagca gatggaaaag cttgggattt gacctttatt 42 0 

aagaatgttg aagtagttgg gaaaaaccag gtaaatattc atttgactga ggcgcattcg 480 

acatttacag cacagttgac agaaatccca atcgtcccta aaaaacatta caatgataag 54 0 

tataagagca atcctatcgg ttcaggacct tacatggtaa aagaatataa ggctggagaa 600 

caagcaatct tcgttcgaaa tccatattgg cacggtaaaa agccttattt taaaaagtgg 66 0 

acttgggtat tacttgatga aaatacagca ctagctgctt tagaatctgg tgatgttgac 72 0 

atgatttacg caacgccaga gcttgctagc aagaaagtca aaggaacacg tcttctagat 780 

attgcttcaa atgatgttcg tgggttatcg ctaccttatg tgaagaaagg tgttgtgaaa 84 0 

aattcaccag acggttatcc agtaggaaat gatgtcacta gtgatccagc aatcagaaaa 900 

gccttgacta ttggtttaaa tagacaaaaa gttctggata ctgttttaaa tggttatggt 960 

aaaccagctt attcaattat tgatagaaca ccattctgga atccaaaaac agcaattaaa 102 0 

gataataaag tagctaaagc taagcaactt ttgacaaaag ctggatggaa agaacaagca 1080 

gacggtagcc gcaaaaaagg aaatcttaaa gcggaatttg acctttacta ccctactaat 114 0 

gatcaattac gagcaaactt agccgttgaa gtagcggagc aagctaaagc cttaggtatt 12 0 0 

actattaaac tcaaggctag taactgggat gaaatggcaa ctaagtcaca tgattcagcc 126 0 

ttactttatg cgggaggacg tcatcatgcg cagcaatttt atgaafccaca ttacccaagt 132 0 

ttagctggta aaggttggac caatattact ttttataaca atcctactgt gactaagtac 13 8 0 

cttgacaaag caatgacatc tcccgacctt gataaagcta acaaatattg gaagttagct 144 0 

cagtgggatg gcaaaacagg tgcttccact cttggagatt taccaaatgt atggttggtg 150 0 

agtcttaacc atacttatat tggtgataaa cgtatcaatg taggtaaaca aggtgtccat 156 0 

agtcatggtc atgattggtc attattgact aacattgctg aatggacttg ggatgaatct 162 0 

gctaagfcaa 162 9 



SeqID 17 
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atgggccaag aacctatcat cgaatatcaa aatatcaata aagfcgtatgg ggaaaatgtt 60 

gcggttgaag atattaacct taaaatttac cctggtgatt tcgtttgttt catcggtacg 120 

agtggatcag gtaaaacaac attaatgcgt atggttaacc atatgttaaa accaacaaat 180 

ggtactctat tatttaaggg aaaagatatt tctactatta accccattga attaagacgc 240 

agaattggat atgttatcca aaacattggt ttaatgcctc atatgaccat ttacgaaaat 3 00 

atagttcttg taccaaaatt attgaaatgg tcagaagaag ctaaaagagc taaagcaagg 3 60 

gaacttatta aattagttga attacccgaa gaatatttgg atcgctaccc tagtgagttg 420 

tctggcggtc agcaacaacg tatcggtgtc attcgcgctc ttgcagcaga ccaagatatt 48 0 

attttaatgg atgagccttt tggagctctg gatcctatta ctagagaagg tattcaagac 540 

ttagtcaagt ctcttcagga agaaatgggg aaaactatca tcttagttac tcatgatatg 600 

gatgaagccc tcaagttagc aacaaaaatt attgttatgg acaatggtaa aatggtccaa 660 

gaagggacac ccaatgatct cttacatcat cctgctacca gtttcgttga acaaatgatt 720 

ggggaagagc gtcttcttca tgcgcaggct gatattaccc ctgttaaaca gataatgtta 780 

aataatcctg tttcaataac tgctgaaaaa acactaactg aagctattac actaatgcgc 840 

caaaaacgcg ttgactcact tctagtaacc gataacggta aattaattgg ttttattgac 900 

ttagaatctc taagcagtaa atataagaaa gaccgacttg tttctgatat cttaaaacat 960 

actgattttt atgttatgga agacgactta cttagaaata ctgctgagcg tattttaaaa 1020 

cgtggtttaa aatacgctcc agttgttgac catgagaata acctaaaggg cattgttact 1080 

cgtgcatccc tagttgatat gttatacgat attatttggg gcgatactga aacggaggat 1140 

caataa 1146 

SeqID 18 

atgaaaatag ataaaaaaga atttttagca cttattgcta gtattatttt attaattttt 60 

gcttctgtta cgtttttctt atttaaagat catggcacta ctcaaatgga tactgtagaa 120 

agcagtgtta accatgtcag tgattctcaa ttaactgagg cacaagatat gttagataaa 180 

tttgagaaga aaccttcaga aaaattattg aaggatgtcg aacttgcttt aaataaattg 240 

tccaattctt ctaaaaaaga agctttgcaa aaacgtttta agaaagcaaa agataagtat 3 00 

cttaaagacg aagcagataa gaaagctacc aaagatgcta cagatttagt agaaattcta 3 60 

gaacaagccc cttcagaaga aaatgtgtta aaagctgaag cagcagtcaa taagttaact 420 

gtcaaagaat ctaaagaagc tttacaaaaa cgtattgata ctgttaagac acaatatggt 480 

ttgattggga atcaaactcc ctctagttca gtagctgaaa ctacagaaca agggacagct 540 

aatcctgcta gtcaggatac ttctagttac gttaatcaga atgtagcacc aactfcatgag 600 

caaccgcaaa cgaafcaatac accagttact ccaggggtta acaatactgt tccgactcca 660 

ggaactggfca ctgcacctgc tactaatggg acgggtgttg ctcagtaa 708 

SeqID 19 

atgactaaag atttattatt ggagcttggc cttgaagagt tgccagctta tgttgtgaca 60 

ccgagtgaga agcaattggg ccaaaaaatg gtcaagtttt tagaggatca tcgtttatca 12 0 

tttgaaactg ttcaaacttt ttcaactcca cgtcgtttgg ctgttcgtgt taaaggattg 180 

gcagaccagc agacagattt aacagaagat ttcaagggtc cttctaagaa aattgctttg 240 

gatgcagaag ggaatttttc taaagcagcc caaggttttg ttcgcggaaa aggtttaagt 3 00 

gttgatgata tcgaatttcg tgaagttaag ggagaagaat atgtttatgt cactaagcat 360 

gaaactggaa aaagtgcgat tgatgtttta gcaagtgtga cagaggtatt aacagaattg 420 

actttcccag tgaacatgca ttgggcaaat aatagttttg aatatattcg tccagttcac 48 0 

actttagtgg tattattaga tgatcaagcc ttagagctag acttccttga tattcattca 540 

gggcgtatta gtcgagggca ccgtttcttg ggttctgaca ctgaaatttt atctgctagt 600 

tcgtatgaag atgacttgcg acaacaattt gttattgctg atgcaaaaga acgccagcaa 660 

atgatcgttg atcaaattca tgctatcgaa gaaaaggaaa atatttcagt fcgaaattgac 72 0 

gaagatttac taaatgaagt gcttaattta gttgagtatc cgaaagcatt cttgggttct 780 

tttgacgaga aataccttga tgtacctgaa gaagttttag tcacttcaat gaaaaatcat 840 

caacgctatt ttgttgttcg cgacagggat ggaaaattgt taccaaactt tatttctgtg 900 

cgtaatggaa atgctgagca tatcgaaaat gttataaaag gtaatgaaaa agttttagfcg 960 

gctcgtctcg aagatggtga attcttctgg caagaagatc agaaacttaa tattgcagat 102 0 

ttagttgaga aattaaaaca agtaactttc catgaaaaaa ttggttcact ttatgaacat 1080 

atggatcgtg taaaagttat ttcacaatat ttggctgaga aggcagatct ttctgatgaa 1140 

gaaaaattag ctgtgcttcg agctgcaagt atttataaat ttgacctctt aactggaatg 12 00 

gttgatgagt ttgatgaatt acaaggtatt atgggtgaga aatatgcctt acttgcaggg 12 60 

gaacagcccg cagtagcagc agcgatacgt gaacattata tgccgacatc ggccgatgga 13 2 0 

gagcttccag aaaccagagt tggtgcaatt ctagcgttag ctgataaatt tgacacctta 13 80 

ctttcattct tctcagttgg tttgattcca agcggttcaa atgatccata tgctcttcgt 1440 

cgagctactc aggggattgt ccgtattttg gaagcttttg gttgggatat cccacttgat 15 0 0 

gaattagtaa ctaaccttta tggtctatcg tttgctagtt tagattatgc taatcaaaaa 1560 

gaggtcatgg cctttatttc agctcgtatt gaaaagatga ttggctctaa ggtccctaaa 162 0 

gatattcgag aagccgtttt agaatcagac acttatattg ttagccttat tctagaagct 168 0 

agccaagcat tggtgcaaaa gagcaaggat gctcaatata aagtttccat tgaaagtttg 174 0 
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tcacgtgcat ttaatttagc tgaaaaagtt actcattctg tatcggtgga ctattcattg 18 0 0 

tttgaaaata atcaggagaa ggcgctatat caagctattt tgtcacttga actgactgag 1860 

gatatgcatg acaacttaga taagcttttc gcacttagtc caattattaa tgatttcttt 1920 

gataatacaa tggtaatgac agatgatgaa aaaatgaaac aaaatcgtct ggctttactc 198 0 

aatagccttg ttgcaaaagc aagaacagtc gctgctttta acttattaaa cacaaaataa 2 04 0 

SeqID 2 0 

atgacatttg acaccattga tcaattagcg gttaatacag tccgcacgct ttctattgat 60 

gctatccaag cagcaaattc tgggcaccca ggtcttccta tgggagctgc gcctatggct 120 

tatgtgcttt ggaataaatt cttaaatgta aacccaaaaa caagtcgcaa ttggacaaac 180 

cgtgaccgtt ttgtgctatc agctgggcat ggttcagctc ttctttatag cctacttcat 240 

ttagctggct atgatttatc aattgatgat ttaaaacaat tccgccaatg gggatcaaaa 3 00 

acacctggac atccagaagt taaccatact gatggtgtag aagcaacaac tggaccttta 3 60 

gggcaaggta ttgctaatgc tgttggtatg gcaatggctg aagctcattt ggctgctaaa 420 

tttaataaac caggctttga tttagttgat cattacactt atacacttca tggtgatggt 480 

tgtttgatgg agggtgttag ccaagaagct gccagtcttg ctggacactt aaaacttggt 540 

aaattggtcc ttctttatga ttcaaatgat atctcacttg atggtccaac gtctcaatca 600 

tttacagaag atgttaaagg gcgttttgaa tcatatggtt ggcaacatat cttagttaaa 660 

gatggtaatg atttagaagc gattgctgca gcaattgaag ctgctaaagc tgaaactgat 720 

aaaccaacca ttattgaagt taaaactatt attggatttg gcgcagagaa acagggaaca 780 

tcttcagttc atggagcgcc acttggtgca gaaggtatta ctfcttgccaa gaaggcttat 840 

ggatgggaat atccagactt tactgttcct gctgaagttg tagctcgatt tgcctcagat 900 

cttcaagcgc gtggtgcgaa ggcagaagag gcatggaatg atctttttgc taaatatgaa 960 

gttgagtatc cagaattagc agctgaatac aaagaagctt ttgctggcca agctgagaca 1020 

gttgaattga aagctcatga cctaggttca tcagtggcta gtcgtgtttc aagccaacaa 1080 

gctattoaac aattatcgac tcaattaccg aacctctggg gagggtcagc agacctctca 114 0 

gcttctaata acacaatggt agcagcagaa acagatttcc aagcaagtaa ttacgctggg 120 0 

cgtaacattt ggtttggfcgt acgtgaattt gcaatggctg ctgcaatgaa tggaattgcg 1260 

cttcacggtg ggactcgtgt ttatgggggt acattctttg tcttctcaaa ctacttgtta 132 0 

ccagcagtac gtatggctgc acttcaaaac ttgccaactg tctatgttat gacacatgat 13 8 0 

tctattgctg ttggagaaga tgggccaact catgagccaa tagaacaatt agcatcagtg 1440 

cgctcaatgc ctaatttgaa cgttatccgt cccgcagacg gtaatgaaac aaacgctgca 1500 

tggcaacgag ctgtatcaga gacagatcga ccaacgatgc ttgttttaac acgtcaaaac 1560 

ttaccagttc ttgaaggaac atcagaatta gcccaagaag gggttaataa aggtgcttat 162 0 

attttatcag aagctaaagg tgaacttgat ggtatcatca ttgcaacagg ttcagaagtt 168 0 

aaattagctt tagatactca agacaaactt gagtcagagg gcatccatgt tcgcgtcgta 174 0 

tctatgccag cacaaaacat ttttgatgaa caagaggctt catatcaaga gcaagtgctt 18 0 0 

ccatcagctg ttacaaaacg tcttgctatt gaggcaggat caagcttcgg ttggggtaaa 186 0 

tatgttggtt taaatgggct aacacttact attgatactt ggggagcatc agcaccaggt 192 0 

aacagaattt tcgaagagta tggctttaca gttgagaatg ctgtatcgct atataaagaa 198 0 

ctttaa 1986 

SeqID 21 

atgactttgc aagatcaaat tatcaaagaa ctcggtgtaa aacctgttat taatccaagt 60 

caagaaatcc gtcgttccgt agaattttta aaagactatc tattaaaaca tagttttcta 12 0 

aagacctatg tactgggtat ttcagggggg caagattcaa ctttagcagg acgtctagct 18 0 

caactagcag ttgaagagtt gcgtgctgat acaggggaaa actatcaatt tatagctatt 24 0 

cgtttaccat atggtattca ggccgatgaa gaagatgcac aaaaagcatt agattttatc 3 00 

aagccagata ttgctttgac cattaatatt aaagaagctg ttgatggcca agttagagct 3 60 

cttaatgcag caggagtaga gattacagac tttaataaag gaaatattaa agctcgtcaa 42 0 

agaatgatta gtcaatatgc agttgcaggt caatatgctg gagcggttat tggtactgat 48 0 

catgctgcag aaaatattac aggttttttc accaaatttg gtgatggagg tgctgactta 54 0 

ttacctctct ttagacttaa taaaagtcag gggaaacagt tattagctga acttggtgct 60 0 

gataaagcct tgtacgaaaa gattccaacc gcagatttag aagaaaataa accaggaatc 660 

gctgatgaaa tagcccttgg agtaacctac caagaaattg atgcttatct tgaagggaag 72 0 

gtggtttcgg acaagtctcg aggtatcatt gaaaattggt ggtataaggg tcaacataaa 78 0 

cgtcatcttc caatcacaat ttttgatgat ttctggaaat aa 822 

SeqID 22 

attaaaaagg aatctgttat caaactattg aagtatgctt ttggcattat aatgggattt 60 

attatcttag ctattgtaat aggtgggctc ctatttgcat actacgttag tcgttctccg 12 0 

aaattaaccg atcaagcttt aaaatccgtt aactctagtt tggtttatga tggtaataat 18 0 

aaacttattg ccgatttagg ctcagaaaag cgtgaaagtg ttagtgcgga tagcattcca 24 0 

ctaaatttgg ttaacgctat cacttctatc gaagataaac gtttctttaa acatagaggt 3 00 

gtcgatattt atcgtatttt aggtgcagct tggcataatc ttgttagtag taatacgcaa 360 
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ggtggttcaa cccttgatca acagttgatt aaactggctt acttttctac caataaatct 420 

gaccaaacgt taaaacgtaa atcacaggaa gtttggcttg cgcttcaaat ggagcgtaaa 4 80 

tacaccaaag aagaaattct tactttctat attaataaag tttatatggg aaatgggaat 540 

tatggtatga gaacaacagc taaatcatac tttggtaaag acctaaagga attatctatt 6 00 

gcacaacttg ctttgctcgc tggtattcct caagcaccta cacaatatga cccttataaa 660 

aacccagaat ctgctcaaac aagacgtaat accgttcttc agcagatgta tcaagataaa 72 0 

aacatttcta aaaaggaata cgaccaagct gttgcaactc cagtaactga tggcttaaaa 78 0 

gaattaaagc aaaaatctac ttatccaaaa tatatggata actacttaaa acaagttatt 84 0 

agtgaagtta aacaaaaaac tggtaaagat atctttactg ctgggctaaa agtgtatact 900 

aatatcaaca ctgatgcaca aaaacaacta tatgacatct acaacagtga cacttacatc 9 60 

gcttatccaa acaa.tgaatt acaaatagca tctaccatca tggatgcgac taatggtaaa 102 0 

gtcattgcac aattaggcgg gcgtcatcag aatgaaaata tttcatttgg gacaaatcaa 108 0 

tctgtcttaa cagaccgcga ttggggttct acaatgaaac ctatctcagc ttatgcacct 114 0 

gctattgata gtggtgtcta taattcaaca ggtcaatcat taaacgactc agtttactac 12 0 0 

tggcctggta cttctactca actatatgac tgggatcgtc aatatatggg ttggatgagt 12 60 

atgcagaccg ctattcaaca atcacgtaac gtccctgctg tcagagcact tgaagccgct 13 2 0 

ggattagacg aagcaaaatc tttccttgaa aaattaggca tatactatcc agaaatgaac 13 8 0 

tattcaaatg ctatttcaag taacaacagt agcagtgatg caaaatatgg tgcaagtagt 144 0 

gagaaaatgg cagcggctta ctcggctttt gcaaacggcg gaacttacta taaaccgcaa 15 0 0 

tatgttaata aaattgaatt tagcgatgga accaatgata cttatgcagc gtctggtagc 15 6 0 

cgtgcgatga aagagactac tgcctacatg atgacggata tgctgaaaac agtactaaca 1620 

tttggtactg gtactaaagc agctatccct ggtgttgcac aagctggtaa gactggtact 168 0 

tccaactata cggaagatga gttagctaaa attgaagcaa ctactggtat ctacaatagc 174 0 

gccgttggta caatggctcc tgatgaaaac tttgtcggct atacttctaa gtacacaatg 18 00 

gcaatttgga ctggttataa aaatcgcctt acaccacttt atggtagcca actggafcatt 1860 

gctactgagg tttatcgtgc aatgatgtcc tacctaactg gcggatacag tgcagattgg 192 0 

acaatgccag aagggctcta tcgaagcgga agctacttgt atatcaatgg gacaactaca 198 0 

accggcactt actcttcttc tgtctataag aatatctatc aaaattctgg acagtcaagt 204 0 

caaagcagta gttcaacatc ctctgagaaa caaaaagagg ataagaatac agcaaatgat 210 0 

gccaattcat catccccaca agtcgaaaca cctaataatg gtaatgctac tacccctaat 2160 

aattccaatc aaactgtgcc aggaactgga catggtaatg gtaatggtaa caacaataca 222 0 

gtacctaacg gtaattaa 223 8 

SeqID 23 

atgttagact taaaacgtat tcgtacagat tttgatgttg tcgctaaaaa attggccacg 60 

cgtggtgttg accaagaaac attaacaacc cttaaagaac ttgatataaa acgccgtgaa 12 0 

ctacttatca aagctgaaga ggcgaaagcg caacgtaatg ttgcttctgc tgccattgct 18 0 

caagcaaaac gaaataaaga gaatgcagac gagcaaatcg cagctatgca aacactttcc 24 0 

gctgatatca aagcaattga tgcggaattg gcagatgtgg atgctaattt acaatcaatg 3 00 

gtgaccgtac ttcctaatac accagctgat gatgtgccgc ttggagcaga cgaggatgaa 360 

aatgttgaag ttcgtcgctg gggaactcct cgtgaatttg attttgagac caaagctcac 420 

tgggatttag gtgaaagcct tggaattctc gattgggaac gtggtgcaaa agttactggc 48 0 

tctcgcttcc tcttctacaa aggtctaggt gctcgtttag agcgtgctat ctacagcttc 54 0 

atgctggatg aacacgctaa ggaggggtac actgaagtga tccctcctta tatggttaac 60 0 

cacgattcaa tgttcggtac gggacaatat ccaaagttca aggaagatac tttcgaatta 66 0 

gcagatagcc cattcgtact tatcccaaca gctgaagttc ctttaacaaa ctattatcgt 720 

gatgaaatta tcgatgggaa agaattacct atatacttta ctgccatgag tccatcattc 7 80 

cgttcagaag caggctctgc aggacgtgac acacgtggat taattcgcct tcatcaattc 840 

cataaagtag aaatggtgaa atttgccaag cctgaggaat cataccaaga attagaaaaa 9 00 

atgacagcaa atgctgaaaa tattcttcaa aaacttaatc tcccataccg tgtcattacc 9 60 

ttgtgtacag gagatatggg attctcagca gctaaaacgt atgacttaga ggtafcggatt 1020 

cccgctcaaa atacttatcg agaaatttct tcatgttcta atactgaaga tttccaagca 10 80 

cgtcgtgctc aaattcgtta ccgtgacgaa gttgatggca aagttaggtt gcttcatact 1140 

ttaaatggtt caggacttgc agttggacgt actgtagcag caatcctcga aaattatcaa 12 00 

aatgaagatg gttctgtgac tatcccagaa gttcttcgtc catatatggg taatatcgat 12 60 

attattaaac caaattag 1278 

SeqID 24 

gtgacaatat caaatcaaga attgacgctt actcccttaa gaggaaaaag tggcaaagcc 60 

tatattggta cttatccaaa tggtgaacgt gtctttgtca aatataatac gacacctatt 12 0 

ttgccagcct tagctaagga acagattgcg ccacaactac tgtgggcacg ccgtacaagt 18 0 

aatggcgata tgatgagtgc tcaagaatgg ttagatggtc gaacattgac taaagaagat 240 

atgggtagta aacagattat tcatatttta ttgcgcctgc acaagtctcg tcccttagta 3 00 

aatcagctgt tacaattagg gtataaaatt gaaaaccctt atgatttgtt gatggattgg 3 60 

gaaaaacaaa ctcctattca aatccgggaa aacacttatt tgcaatctat tgtgactgag 420 
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ttaaagcgta gtttacctga gtttcgaaca gaggttgcaa caatcgttca cggagatatc 480 

aaacatagca actgggttat tacaacaagt ggtttaattt atttggttga ttgggattct 54 0 

gttcgtttaa cagatcgcat gtatgacgtt gcatatattt taagtcatta tattccacaa 600 

aaacactgga aagattggct gtcttactac ggttataaag ataatgagaa agtctggagc 660 

aaaattatct ggtatgggca attttcatat ttatcacaga taatcaaatg ttttgataaa 720 

cgagatatgg agcatgtcaa tcaagaaatt tatgaattac gaaagtttag agagttaatt 780 

aaaaagcata atgcgagtta g 8 01 



SeqID 25 

atgaaaatat cacaatacaa taaatggtca attcgccgat tgaaagttgg agctgcatct 6 0 

gtaatgattg ctagtggaag cattgttgct ttagggcaat ctcatattgt ttcagcagat 12 0 

gagatgtccc aacctaaaac aaccattaca gctccaacag ctaacacttc tacaaatgta 180 

gaaagtagta ctgataaggc attatcaaaa gttactacga tggagacaag ttcagaaatg 240 

cctaaaatgc aaaatatggc aaaagttgaa aaaacgagtg acaaacctat gatggtggct 300 

acctctgtta gaaaaatgat ggcaacgccg acaccfcgtag cgatgaccaa aacgactagt 3 60 

gtagatgagg taaagaaatc aactgatact gcttttaaac agacagtaga cgttccggct 42 0 

cattatgtta atgcagcaaa agggaatggc ccatttctag caggtgtaaa tcaaacgatt 48 0 

ccatatgaag cgtttggtgg agatggcatg ttaacgcgtfc taatccttaa atcttctgaa 54 0 

ggagcaaaat ggtcagataa tggtgttgat aaaaatagtc cattattacc attgaaaggc 600 

ttaactaaag gtaaatattt ctatcaagtt tctttaaatg gtaatacgac aggtaaagag 66 0 

ggacaagctc tcttagatca aattaaagct aatgataaac atagctacca agccacaatt 720 

agagtttatg gtgctaaaga tggtaaagtt gatttaaaaa acatgattag ccaaaaaatg 78 0 

gtaaccatta acattcctca cattacaact gatatggaag taaaaaattc tctaaagatg 84 0 

gcctttaaag aaaaagtaga cgtcccagca aaatatgtta gtgctgcaaa agctaaaggt 900 

ccatttttag caggtgttaa cgaaacgatt ccatatgaag cgtttggtgg ggatggcatg 960 

ttaacgcgtt taatccttaa agcttcagag ggagcaaaat ggtcagacaa tggtgttgat 102 0 

aaaaatagtc cattattacc gttaaaagat ttaactaagg gtaaatactt ctaccaagtc 108 0 

tctcttaatg gcaatacagc cggtaaaaaa ggccaagctc tcttagatca aattaaggca 114 0 

aatggaagcc atacttatca agcaactatc actatttatg gtacgaaaga tggcaaagtt 120 0 

gatatgaata ctattttagg acaaaagaca gtgatgattc atattaatgt cgctaaaaaa 126 0 

gatatgaata gcacatcaat gatgatgaaa aaagataaga tgacaatgcc tatgaagaaa 132 0 

gaaatgactt catctaaaat aaataccggt atgatgatgt ctaacaacaa gatgtcagct 13 8 0 

aatatgcaaa tgtcatcaca agccaaatca aatgataaag ctggtaaaaa aatgtctatg 144 0 

atgtctaaaa atttacctaa tacaggtgaa acaaagcaac aaaatgtagg tgtgctaggt 150 0 

atgcttagtc tagcctttgc gacaggtcta actgcgctag gccttaaaaa atctaaacaa 156 0 

agatag 1566 



SeqID 26 

atgacacata ttacatttga ctattcaaaa gtcttgggtc aatttgtagg cgaacacgag 6 0 

ttagactacc tacaaccaca agtaagtgca gcagatgctt tccttcgtca agggactggt 12 0 
cctggctcag attttctcgg atggatggac ctcccagaaa actatgacaa agaagaattt 18 0 
tctcgcattc aaaaagccgc tgaaaagatt aaatcagata gcgaagtact cgtggttatt 240 
ggtattggtg gttcgtacct tggcgcaaaa gcagcaattg actttttgaa taatcatttt 3 00 
gctaatttgc aaaccgcaga agaacgtaaa gcgcctcaga ttctttatgc tggaaattct 3 60 
at t teat eta cttaccttgc cgatttagtt gaataegtec aagataaaga attctcagta 420 
aatgtcattt caaaatcagg tacaacaact gaaccagega ttgctttccg tgtatttaaa 480 
gaacttctag ttaaaaagta eggtcaagaa gaagctaata aacgtatcta cgctacaact 540 
gataaagtta aaggtgccgt aaaagttgaa geagatgeta acaattggga aacatttgta 600 
gttcctgata atgttggtgg tegtttctea gtactgacag cegtaggtet tcttccaatt 660 
getgettcag gagcagatat tactgetett atggaaggcg caaatgetge tegtaaagat 720 
ttatcttcag ataaaatctc tgaaaacatt gcttatcagt aegctgeagt tcgtaatgtt 780 
etttatcgea aaggttatat cacagagatt ttagctaact acgaaccatc attacaatac 840 
tttggtgaat ggtggaaaca attagcaggt gaatcagaag gtaaagatca aaaaggtatc 900 
tatcctacat cagcaaactt ctcaactgat cttcactcac ttggtcaatt tattcaagaa 960 

ggttaccgta acctctttga gacagttgtt cgcgttgaga aacctegtaa aaatgtgact 1020 

attcctgaat taaccgaaga tttagatggt cttggttatc ttcaaggaaa agatgtagat 10 80 

tttgttaata aaaaagcaac agatggtgta ctccttgctc atacagatgg tggggttcca 1140 

aatatgtttg taacgcttcc tacacaagac gcttacaccc ttggttacac tatttacttc 12 00 

tttgagttag caattggect ttcaggttat cttaactcag taaatccatt tgatcaacca 1260 

ggggtagaag catataaacg taatatgttt gctcttctag gtaaacctgg attcgaagag 1320 

ettagegctg aattgaatgc aegtctttaa 1350 

SeqID 27 

atgaaagaaa aacaaacagc eggaegtegt caattagaag aatttgeace tgaatttgee 60 
agatataatg atgatatttt atttggtgag gtttgggcaa aagaggatca tttaacagat 120 
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aaaacacgtt etattattac tafcatctgcc 
gagcatcatc ttcaatttgc caaacaaaac 
atcactcacc ttgcttttta tgttggatgg 
aaggaaattt ggatataa 



ttaattagcg gaggaaatct tgaacagtta 180 
ggagttacta aagaagaaat tgctgacatt 240 
ccaaaagctt ggtcagcctt taataaagct 3 00 

318 



SeqID 2 8 

gcttggagag ctttagagga tgctcaaaaa tctggtaagg taaaatccat cggtgtctca 60 

aactttttag aaaaagactt agaaaatatc ttaaaaaatg gtcatgttaa acctgctgtc 120 

aatcaaatct tagctcacat cggaaataca cctttcgact tgattgatta ctgtcaaagt 180 

aaaggtattc aagttgaagc ttatt caeca attgeacatg gacaagcatt gaaatctgat 240 

ggtattcaaa aaatggctga aaaatatggg gttagtgttg ctcaattatg tattcaatat 3 00 

ttactacaat taaacctgat cgttttacca aaggcttcta gtaaagaaca tttacagtct 3 60 

aatctagact ttgactttgt aattagtgac gaagatatgt cgatattaaa atcacttatg 420 

tfctgacgatt atggggaatt ttctaacttc cctgttttta gtgaaaaatg a 471 

SeqID 2 9 

atgtttagaa ggtctaaaaa taacagttat gatacttcac agacgaaaca aeggttttea 60 

attaagaagt tcaagtttgg tgeagcttet gtactaattg gtcttagttt tttgggtggg 120 

gttacacaag gtaatcttaa tatttttgaa gagtcaatag ttgetgeate tacaattcca 18 0 

gggagtgcag cgaccttaaa tacaagcatc actaaaaata tacaaaaegg aaaegcttae 240 

afcagatttat atgatgtaaa gaatggattg attgatcctc aaaacctcat tgtattaaat 3 00 

ccatcaagct attcagcaaa ttattatatc aaacaaggtg ctaaatatta tagtaatccg 3 60 

agtgaaatta caacaactgg ttcagcaact attactfctta atatacttga tgaaactgga 420 

aatccacata aaaaagctga tggacaaatt gatatagtta gtgtgaattt aactatatat 480 

gattctacag ctttaagaaa taggatagat gaagtaataa ataatgeaaa tgatcctaag 540 

tggagtgatg ggagtcgtga tgaagtctta actggattag aaaaaataaa aaaagatatt 600 

gataataatc caaaaacaca aatagatatt gataataaaa ttaatgaagt caatgaaata 660 

gagaaattgt tagttgtatc gctaccagat aaaattaagt attcaccaga ggctaagcat 720 

aggactgttg aacaacacgc ggaattagat gcaaaagata geattgeaaa tacagatgaa 780 

ttgccatcaa attcaaegta taactggaaa aafcggtcata aaccagacac ctcaacatca 840 

ggtgaaaaag aeggaattgt tgaagttcac tatccagatg gtactgttga tgatgtgaat 900 

gttaaagtaa ccgttacatc gaaaaaaact gataatacag ctccaacatt aaccgtcact 960 

ccagagcaac agactgttaa agtggatgaa gatattacct ttacggttac agctgaagac 1020 

gaaaatgaag ttgaactagg tttagafcgat cttaaagcta agtatgaaaa tgatatcatt 1080 

ggagctcgtg ttaaaattaa gtatcttact aaagaaccta ataagaaagt catggaagtg 1140 

acaattatga aagctacttt agcagataag ggegcaatta ccttfcactgc aaaagataaa 12 00 

gcaggtaatc aagcagaacc taagacagtt accatcaatg ttcttccgcc agataaaatt 12 60 

aagtattcac cagaggctaa gcataggact gttgaacaac aegeggaatt agatgeaaaa 1320 

gatagcattg caaatacaga tgaattgeca tcaaattcaa cgtataactg gaaaaatggt 13 80 

cataaaccag acacctcaac atcaggtgaa aaagacggaa ttgttgaagt tcactatcca 1440 

gatggtactg ttgatgatgt gaatgttaaa gtaaccgtta catcgaaaaa aactgataat 15 00 

acagctccaa cattaaccgt cactccagag caacagactg ttaaagtgga tgaagatafct 1560 

acctttaegg ttacagctga agacgaaaat gaagttgaac taggtttaga tgatcttaaa 1620 

gctaagtatg aaaatgatat cattggagct cgtgttaaaa ttaagtatct tactaaagaa 1680 

cctaataaga aagtcatgga agtgacaatt atgaaagcta ctttagcaga taagggegea 1740 

attaccttta ctgeaaaaga taaagcaggt aatcaagcag aacctaagac agttaccatc 18 00 

aatgttcttc cgccagataa aattaagfcat tcaccagagg ctaagcatag gactgttgaa 18 60 

caacacgcgg aattagatgc aaaagatagc attgeaaata cagatgaatt gecatcaaat 192 0 

teaaegtata actggaaaaa tggtcataaa ccagatacct caacaccagg agagaaaaac 198 0 

gctgttgttg ttgtgaccta tccagataag tcaacagatg aagttcctgt aaaagttaca 204 0 

gtggttgacc caegtacaga cgcagagaaa aatgatccag caggtaaaga tcaaactgtc 2100 

aaagtgggtg aacaaccaga tccgacaaaa tctcttgaag cagtaccagc tggaagtaca 2160 

gtagegtaca aagaaccagt tgatactaag acaccaggag agaaaaaege tattgttgtt 222 0 

gtgacctatc cagataagtc aacagatgaa gttcctgtaa aagttacagt ggttgaccca 22 8 0 

cgtacagacg cagagaaaaa tgatccagca ggtaaagatc aaactgtcaa agtgggtgaa 234 0 

caaccagatc cgacaaaatc tcttgaagca gtaccagctg gaagtacagt agegtacaaa 24 0 0 

gaaccagttg atactaagac accaggagag aaaaaegcta ttgttgttgt gacctatcca 246 0 

gataagtcaa cagatgaagt tcctgtaaaa gttacagtgg ttgacccacg tacagacgea 252 0 

gagaaaaatg atccagcagg taaagatcaa actgtcaaag tgggtgaaca accagatccg 258 0 

acaaaatctc ttgaagcagt accagctgga agtacagtag cgtacaaaga accagttgat 2640 

actaagacac caggagagaa aaacgctgtt gttgttgtga cctatccaga taagtcaaca 270 0 

gatgaagttc ctgtaaaagt tacagtggtt gacccacgta cagaegcaga gaaaaatgat 276 0 

ccagcaggta aagatcaaac tgtcaaagtg ggtgaacaac cagatccgac aaaatctctt 282 0 

gaagcagtac cagctggaag tacagtagcg tacaaagaac cagttgatac taagacacca 288 0 

ggagagaaaa acgctgttgt tgttgtgacc tatccagata agtcaacaga tgaagttcct 2940 
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gtaaaagtta cagtggttga cccacgtaca gacgcagaga aaaatgatcc agcaggtggg 3 000 

gaaacaacag taccacaagg cacaccgatc agtgatgaag aaattacagg attagtgaag 3 0 60 

attccagaag gatcaaacgg cgtaccgaaa gtagtaggag accgtccgaa tacagacgta 3120 

ccaggagatt ataaagtaac ggtagaagta acgtatccag atggaacaaa ggatacagta 318 0 

gcagtaacgg ttcatgtgac accaaaacca gtaccggata aagataaata tgatccaaca 3 240 

ggtaaatctc agcaagtcaa cggtaaagga aataaactac cagcaacagg tgagagtgca 3 3 00 

actccattct ttaatgttgc agctttgaca attatatcat cagttggttt attatctgtt 3 360 

tctaagaaaa aagaggatta a 3 3 81 



SeqID 3 0 

atgcagattc tcgaagacta tgatggtaga gcattaccca agctagaaac ggatcgactt 6 0 

atcttgcgtc agcgaacggt gggggatgtc ccagccatgt ttgattacgt ttgcttggaa 12 0 

gaggfctgcct atccagcagg gctcagccca atagcatctt tagaagatga gtatgattat 18 0 

tttgaaaatc gctactatca aaatctggaa aaggcaaaat taccatctgg ttatggtatt 24 0 

acggttaaag gaagtgaccg gattatcgga tcctgcgcct ttaaccatcg ccgtgaagac 300 

gatgttttcg agattggcta tctgcttcat cctgactact ggggtcatgg ctatatgaca 3 60 

gaagctgtcg ccgctttgat tgaagtcggt tttacccttc tcaatctcca caaaatcgaa 42 0 

atccgctgct atgactacaa taagcaaagc cagcgcgtgg cagaaaaact aggattcacc 48 0 

cttgaagcaa ccatccgtga tcgtaaggat aatcaaggca atcgctgtgt caatttgata 54 0 

tatggtttgc tgaggagtga gtgggagtaa 570 



SeqID 31 

atgtcaaaag aactatcacc aaaatacaat cctgccgagg ttgaggaagg ccgttatcaa 6 0 

acttggcttg accaagatgt tttcaagcca tcaggggata ctgaggctaa gccctattct 120 

atcgtgattc caccaccaaa cgttacgggt aaattacact tgggtcacgc ttgggataca 18 0 

actctacaag atatcattat tcgtcaaaag cgtatgcagg gctttgatac actctggctt 24 0 

ccaggtatgg accacgctgg tatagccact caagctaagg ttgaggagcg ccttcgtgag 3 00 

caaggtattt cacgttatga tcttggtcgt gaaaaattcc tagataaagt ttgggaatgg 3 60 

aaagacgagt atgcagcaac tattaaatct cagtggggga aaatgggtct ttcggttgat 42 0 

tatagccgtg agcgttttac ccttgatgag ggactttcaa aagcggtaag aaaagtcttt 48 0 

gttgatttgt acaataaagg ttggatctat cgtggggaat tcatcattaa ctgggatcca 540 

gcagctcgta cagccctttc agatattgaa gttattcaca aggatgttga gggtgctttc 60 0 

tatcatatga attatatgct ggaagatggc tcacgcgccc tagaagtagc aacaactcgt 66 0 

cctgagacta tgtttggaga tgtggctgta gcggtaaatc cagaagatcc acgttacaaa 72 0 

gatttgattg gtcaaaatgt tatccttcct atcatcaata aacctattcc aatcatcgct 78 0 

gatgaacacg ctgatccaga atttggaaca ggtgtggtta agattacgcc ggctcacgac 84 0 

cctaatgact tcgctgtagg tcaacgccac aacttaccac aagttaacgt tatgaacgat 90 0 

gatggtacta tgaatgaact tgcggatgaa tttaatggta tggatcgttt cgaagctcgt 9 60 

aaggcagtag tcgctaaatt agaatcgtta ggaaacttag taaaaattga aaaaatgact 102 0 

cattctgttg gtcattcaga acgaacaggc gttgtggttg aaccacgttt atctactcaa 10 8 0 

fcggttcgtca agafcggatca attggctaag aatgccattg ccaaccaaga tacagaagac 114 0 

aaggtagagt tttatccacc acgatttaac gatactttta tgagctggat ggaaaatgtc 12 0 0 

catgactggg ttatctctcg tcaattatgg tggggacacc aaattcctgc atggtacaac 1260 

gtcaatggtg agatgtatgt tggagaagat gccccagagg gtgacggatg gacacaggat 132 0 

gaagacgttc tagacacatg gtttagttct gctttatggc cgttttcaac aatgggatgg 13 8 0 

ccagacaccg aagcagctga cttcaaacgc tacttcccaa catcaacctt ggtaactgga 144 0 

tacgatatca tcttcttctg ggtatcgcgc atgattttcc aatctcttga atttacagga 15 0 0 

cgccaaccat tctcaaatgt tcttattcat ggactcatcc gtgatgaaga aggacgtaaa 15 60 

atgtctaaat cacttggcaa cgggattgat ccgatggatg tcattgagaa atatggtgcg 162 0 

gatgcccttc gttggttcct ctcaaatggt tcagcccctg gtcaagatgt gcgtttttca 168 0 

tacgaaaaaa tggatgcttc gtggaatttc attaataaga tfctggaacat ttcacgctat 174 0 

atccttatga ataacgaagg tttaacgttg gatcaagcca gagaaaatgt ggaaaaagtt 18 00 

gttaatagtc aagttggaaa tgtaacagac cgttggatfcc tccacaattt aaatgagact 18 6 0 

gtgggtaaag ttactgaaag ctttgacaag tttgaatttg gtgttgctgg tcacatcctt 192 0 

tacaacttta tctgggaaga atttgctaat tggtacgttg agttaacaaa agaggttctt 198 0 

tatagtgaca atgaagatga gaaagttgtt acacgttctg ttcttcttta taccttagat 2 04 0 

caaatcttgc gtcttcttca tccaatcatg ccatttgtga cagaagaaat ttttggacaa 210 0 

tacgcagaag gatcaattgt ccttgcaagt tatccacaag ttaacgcgac atttgaaaat 216 0 

cagacagctc acaaaggcgt tgaaagcttg aaagatttga ttcgttcagt gcgaaatagc 222 0 

agagcagagg tcaatgttgc tccatctaaa ccaatcacta tcttggttaa gacaagtgac 22 8 0 

agcgaactcg aaagcttctt caaagacaat agtaactaca tcaaacgctt cacaaaccca 234 0 

gagacgcttg aaatcagctc agcgattact gccccagaac ttgccatgac tagcattatc 24 0 0 

acaggtgcag aaattttcct cccacttgct gacctcctta acgttgaaga agagttggca 246 0 

cgacttgaaa aagaacttgc taaatggcaa aaagaactaa atatggtcgg caaaaaactc 252 0 

agcaacgagc gctttgtagc taacgctaaa ccagaagtcg tccaaaaaga aaaagacaaa 258 0 
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cagaccgact accaaactaa atacgacgca actatagcac gtattgaaga gatgaaaaaa 2640 

ttaaataatg attga 2655 

SeqID 3 2 

atggttgagc caattatttc aatacaagga cttcataaaa gttttgggaa aaatgaggtt 60 

ttaaaaggca ttgacttgga tattcatcaa ggagaagtgg tggttattat tggcccttct 120 

ggctctggta agtcaacatt tttaagaaca atgaatctct tggaagtacc aacaaaggga 18 0 

acagtgactt ttgaagggat tgatataaca gacaaaaaga atgatatttt taaaatgcgc 24 0 

gaaaaaatgg gcatggtttt tcaacagttc aatctatttc ccaatatgac tgtactagaa 3 00 

aatattactt tatcacctat taagacaaag ggactttcta agcttgatgc tcagacaaaa 360 

gcatacgagc tacttgaaaa agttggactc aaagagaagg ctaatgctta tccagctagc 42 0 

ttatctggag ggcaacaaca acgaattgct attgcaagag gtcttgcaat gaatcctgat 48 0 

gtccttcttt ttgatgaacc tacttcagct cttgatcctg aaatggtagg tgaagtcttg 540 

actgttatgc aagatttagc taaatctggt atgacgatgg ttattgtcac tcatgaaatg 600 

ggttttgcac gtgaagtagc ggatcgtgtc atttttatgg atgcaggcat tattgttgag 660 

caagggaccc ctaaggaagt atttgagcag acaaaagaaa tccgcacaag agatttctta 72 0 

agtaaagtat tataa 735 

SeqID 33 

atgaattata aagagattta tcaagagtgg ttagaaaacg actcactcgg taaagatatt 60 

aagtcagatt tagaagctat taaaggcgat gaatctgaaa ttcaggatcg tttttacaaa 12 0 

acattagaat ttggaacggc gggattgaga ggtaaacttg gagcaggaac caatcgtatg 18 0 

aatacttata tggtggggaa agcagcacaa gcattagcta atacgattat tgatcatggc 24 0 

cctgaagcta ttgcacgtgg aattgcagtt agttatgatg tccgttatca atctaaggaa 300 

tttgcagaat taacttgttc cattatggca gcaaatggta fctaagtctta tatttataaa 3 60 

gggattcgcc caacaccaat gtgctcatat gctattcgtg ctctaggatg tgtttcgggt 420 

gtgatggfcta ctgctagtca taatcctcaa gcttataatg gttataaggc atattggaaa 48 0 

gaaggatctc agattttaga tgatattgct gatcaaattg ccaatcatat ggatgctata 54 0 

accgattatc agcaaattag gcaaataccc tttgaagagg ctctggcaag tggtttggca 600 

agttatafctg atgagagtat tgaagaagca tataaaaaag aagtgcttgg tttaaccatt 660 

aatgatacta acattgataa gtcagtccga gtagtttata ctcctttaaa tggcgtagga 72 0 

aatttacctg tgcgcgaagt tttaagacgc cgtggttttg aaaatgttta tgtggtacct 780 

gagcaggaaa tgcccgatcc tgattttaca acggttggct atccaaaccc tgaagttcct 84 0 

aaagcatttg cctattcaga atctctagga aagtcagttg atgcagatat cttacttgcc 90 0 

acagatccag attgtgaccg agtagcattg gaagtcaaag atagtaaggg agaatatatt 960 

ttcttaaatg gtaataagat aggggcactt ctttcctatt atattttttc acaacgatgt 1020 

gccttaggga atttgccaca tcatcctgta ttggtaaaat ccattgtaac tggtgatcta 1080 

tcaaaagtta ttgcagataa atataatatt gaaactgttg aaactttaac cggatttaaa 114 0 

aatatttgtg gaaaagctaa tgaatatgat atctcaaagg ataaaactta tctctttggc 120 0 

tatgaagaaa gtattggttt ttgctatggc acttttgtac gtgataaaga tgctgtgagt 1260 

gcttcaatga tggtagtaga aatgactgcc tattataaag aacgagggca aacactttta 13 2 0 

gacgttttgc aaaccattta cgatgaattt ggctattaca acgagcgcca attttctctt 13 8 0 

gagttagagg gtgctgaggg gcaagaacgt attagtcgta ttatggagga ttttagacag 1440 

gacccaatat tacaagtagg tgagatgaga ttggagaatt ctattgattt caaggatggt 1500 

tataaggatt ttccaaagca aaattgttta aaatattatt ttaatgaggg ttcatggtat 1560 

gctttaaggc cgtcagggac ggaacctaag ataaaatgtt acctttatac gattggttgt 1620 

acagaagcag afcagtttatc gaaacttaat gcaattgagt cggcttgtcg tgctaaaatg 1680 

aatagtacta aataa 1695 

SeqID 3 4 

atgtatagag aaattaccgc tgtcgaacac gatcgctttg tgagcgaatc caaccaaaca 60 

aacctacttc aatcttctaa ttggcccaaa gtaaaagaca actggggtag tcaattactt 120 

ggcttttttg acggtgaaac ccaaattgcc agcgctagta ttctcatcaa atcacttcct 180 

cttggcttct ccatgctata tattccgcgt ggaccaatca tggattactc caatctagat 240 

attgtaacta aggtccttaa ggaccttaaa gcttttggaa aaaaacaaag agctctcttt 3 00 

atcaagtgtg atcctctcat ctatttaaaa atggtcaatg ccaaagattt tgaaaattcg 3 60 

cccgatgaaa aagaagggtt aatcgccatt gatcatttac agcgtgcagg tgctgattgg 420 

actggtcgga caacagattt agctcatact atccaaccac gttttcaggc aaacctatat 4 80 

gctaatcaat ttggacttga taaaatgtct aaaaaaactc gtcaagctat tcgaacttcc 540 

aaaaataagg gagtagatat acaatttggt agccatgaac tacttgaaga ttttgcagag 600 

ttgatgaaaa aaacggaaga ccgtaaaggg ataaatctta gaggaattga ctactaccaa 660 

aaactccttg atacgtatcc taataactca tacatcacaa tggcttcgct agatgttgca 720 

aaacgattag aaaaaataga aaaagaatgt cagatagccc aatctgaaag aataaaatca 780 

cttgaactta atcgtgagaa aaaagttaaa caacaccaag gtacgattga tcgattaaat 84 0 

aaagaaattg attttctcaa agaagctcaa aaagcctatg accgagacat tattccattg 900 
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gctgcaaccc tcactctaga atttggtaat acatcagaaa atatttatgc tggtatggac 96 0 

gattactfcta aatcttattc tgctcctatt tacacttggt ttgaaacagc tcaaagggct 102 0 

tttgaacgtg gaaatatttg gcaaaatatg ggggggattg agaatgactt atctggtggt 108 0 

ttatatcatt ttaagtctaa gttcgaacca atcattgagg aatttafctgg agaatttaac 1140 

attccagtga acagattgct ttataaagcc tctaattatg tctatgcctt acgcaaaaaa 120 0 

cgtaatagct ag 1212 

SeqID 35 

atggcttgta caacaatatt ggttggtaaa aaggcttctt atgatggttc gactatgatc 6 0 

gctagaacgg aagactctgt taatggcgat ttcacaccca aaaaattaaa ggtaatgaca 12 0 
tctaaagatc aaccgcgtca ttacaaatca gttttatcaa attttgaagt agatttacca 18 0 
gataacccac ttccttatac ttcagtaccg gacgcattgg gaaaagatgg tatatggggt 240 
gaagccggta ttaacagtaa aaatgtagcg atgagtgcta cagaaactat tacaacgaat 3 00 
tcccgcgttt tgggtgcaga tcctttggtt tcagatggta taggggaaga ggatatactc 3 60 
actttagtgc ttccctatat tcagtcagcg cgagaaggtg tggagcgttt aggtgctatt 420 
ttggaaaaat atggaaccta tgaatcaaat ggtattgctt tttcagatac cgaagaaata 48 0 
tggtggttag aaacaattgg tgggcatcat tggattgctc gtcgcgtacc tgatgatgtt 54 0 
tatgttacta atcctaacca actaggaatt gatcattttg aatttaataa ctgtgatgac 60 0 
tacatgtgct ctagtgattt gaaagagttt atcgaacaat accatttaga tttgacctat 660 
tctaatgagc atttcaatcc tcgatatgct tttggtagcc aacgtgataa agatcgtcat 720 
tacaacacac caagaagttg ggcaatgcag cgttttttaa atcctgaaat tgaacaggat 780 
ccacgtagct tgtttattcc ctggtgtcaa aagccttacc gaaaaattac tgttgaggat 84 0 
attaaatatg tgttgagtga tcattatcaa gacagtgtgt atgacccata tggaccagaa 90 0 
9*9*ggatgcgg taagtaggag agcttttcgt tcagttggta tcaaccgaac tagtcaaacg 960 

tctattctac aattacgacc aaataaatca cttgaaacga caggtgttca atggttatct 1020 

tatggctcta tgccatttgc aaccatggtg ccgttgttta cacaagttga gactgtacca 108 0 

aactattttt cgaatacaac caaggatgct tcaacagata atttttattg gaccaatcgt 1140 

ttaattgcag ctctagcaga tccacacttt tatcaacatg aagctgatat tgaaagctat 120 0 

atcgagagaa cgatggctca aggacatgca catattaacg gtgttgatag agaagttgct 126 0 

gagaataaag agattgattt tcaacagaaa aatcaagaaa tgagtgacta tatccaaaaa 132 0 

gaaagccaag aattgttaaa tcgtattcta tttgatgcaa gtaatttaat gacaaatcgc 13 8 0 

ttttcaatgg gagattaa 1398 

SeqID 36 

atgagaaaga aatttctttt attgatgagc tttgtagcta tgtttgcagc ttggcaactt 60 
gttcaagtta aacaagtttg ggctgatagt aaacttaaag tggtaacaac tttttaccca 12 0 
gtttatgagt ttacaaaaaa tgtcgttggt gataaagctg atgtatctat gttaattaaa 18 0 
gcaggtacag aaccgcatga ttttgaacca tcaactaaaa acatcgctgc catccaagat 240 
tcaaatgctt ttgtttacat ggatgataac atggaaactt gggctccaaa agtagctaag 3 00 
tcagttaaat ccaaaaaagt aacaactatt aaaggtactg gcgatatgtt acttactaaa 3 60 
ggcgtcgaag aagaaggtga agaacatgaa ggacatggtc atgaagggca tcatcatgaa 42 0 
cttgacccac acgtatggtt gtctccagaa cgtgcgattt ctgttgtaga aaacatccgt 480 
aataaatttg tcaaagctta tccaaaagat gcagcttcat ttaacaaaaa tgcagatgct 54 0 
tacattgcaa aattaaaaga gcttgacaaa gaatacaaaa atggtttgtc aaatgctaaa 60 0 
caaaagagtt ttgtgactca acacgcagcg tttggttaca tggcgcttga ttacggttta 660 
aatcaagttc caattgctgg tcttactcca gatgcagaac cttcatcaaa acgtttaggc 72 0 
gaattagcta aatacatcaa gaaatataac atcaactaca tttattttga agaaaatgct 78 0 
tcaaataaag ttgctaaaac tttagcagat gaagttggcg tgaaaacagc tgtgcttagt 84 0 
ccacttgaag gactttctaa aaaagaaatg gcagctggcg aagattactt ctcagttatg 900 
agacgtaatt tgaaagttct taaaaagaca acagatgttg caggtaaaga agtagctcct 960 

gaagaagata aaactaaaac agttgaaaca ggttacttta aaactaaaga tgttaaagac 1020 

cgtaaattga cagattactc tggtaattgg caatcagtat atcctcttct tcaagatggg 1080 

acacttgatc cagtttggga ttacaaagct aaatctaaaa aagatatgac tgctgcagag 1140 

tacaaaaaat attatacagc aggttacaag actgacgtag aatcaatcaa gattgatggt 1200 

aaaaaacatc aaatgacctt tgtacgtaat ggtaaatcac aaacatttac atacaaatat 1260 

gcaggttaca aaatcttaac ttataaaaaa ggtaatcgtg gagtacgtta tctctttgaa 13 20 

gctaaagaaa aagatgctgg tcaattcaaa tatatccaat ttagtgacca tggtattaaa 13 80 

ccgaataaag ctgaacactt ccatatcttc tggggttcag aaagccaaga aaaattattt 1440 

gaggaaatgg aaaactggcc aacatacttc ccagctaaaa tgtctggacg tgaagttgcc 1500 

caagacctta tgtctcatta a 1521 

SeqID 3 7 

atgaaattat cgaagaagtt attgttttcg gctgctgttt taacaatggt ggcggggtca 60 
actgttgaac cagtagctca gtttgcgact ggaatgagta ttgtaagagc tgcagaagtg 120 
tcacaagaac gcccagcgaa aacaacagta aatatctata aattacaagc tgatagttat 180 
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aaatcggaaa ttacttctaa tggtggtatc gagaataaag acggcgaagt aatatctaac 24 0 

tatgctaaac ttggtgacaa tgtaaaaggt ttgcaaggtg tacagtttaa acgttataaa 3 00 

gtcaagacgg atatttctgt tgatgaattg aaaaaattga caacagttga agcagcagat 3 60 

gcaaaagttg gaacgattct tgaagaaggt gtcagtctac ctcaaaaaac taatgctcaa 42 0 

ggtttggtcg tcgatgctct ggattcaaaa agtaatgtga gatacttgta tgtagaagat 48 0 

ttaaagaatt caccttcaaa cattaccaaa gcttatgctg taccgtttgt gttggaatta 540 

ccagttgcta actctacagg tacaggtttc ctttctgaaa ttaatattta ccctaaaaac 60 0 

gttgtaactg atgaaccaaa aacagataaa gatgttaaaa aattaggtca ggacgatgca 660 

ggttatacga ttggtgaaga attcaaatgg ttcttgaaat ctacaatccc tgccaattta 72 0 

ggtgactatg aaaaatttga aattactgat aaatttgcag atggcttgac ttataaatct 78 0 

gttggaaaaa tcaagattgg ttcgaaaaca ctgaatagag atgagcacta cactattgat 84 0 

gaaccaacag ttgataacca aaatacatta aaaattacgt ttaaaccaga gaaatttaaa 90 0 

gaaattgctg agctacttaa aggaatgacc cttgttaaaa atcaagatgc tcttgataaa 960 

gctactgcaa atacagatga tgcggcattt ttggaaattc cagttgcatc aactattaat 102 0 

gaaaaagcag ttttaggaaa agcaattgaa aatacttttg aacttcaata tgaccatact 10 8 0 

cctgataaag ctgacaatcc aaaaccatct aatcctccaa gaaaaccaga agttcatact 114 0 

ggtgggaaac gatttgtaaa gaaagactca acagaaacac aaacactagg tggtgctgag 12 0 0 

tttgatttgt tggcttctga tgggacagca gtaaaatgga cagatgctct tattaaagcg 12 60 

aatactaata aaaactatat tgctggagaa gctgttactg ggcaaccaat caaattgaaa 13 2 0 

tcacatacag acggtacgtt tgagattaaa ggtttggctt atgcagttga tgcgaatgca 13 8 0 

gagggtacag cagtaacfcta caaattaaaa gaaacaaaag caccagaagg ttatgtaatc 144 0 

cctgataaag aaatcgagtt tacagtatca caaacatctt ataatacaaa accaactgac 15 00 

atcacggttg atagtgctga tgcaacacct gatacaatta aaaacaacaa acgtccttca 156 0 

atccctaata ctggtggtat tggtacggct atctttgtcg ctatcggtgc tgcggtgatg 162 0 

gcttttgctg ttaaggggat gaagcgtcgt acaaaagata actaa 1665 



SeqID 3 8 

atgaaaaaga gacaaaaaat atggagaggg ttatcagtta ctttactaat cctgtcccaa 6 0 

attccatttg gtatattggt acaaggtgaa acccaagata ccaatcaagc acttggaaaa 120 

gtaattgtta aaaaaacggg agacaatgct acaccattag gcaaagcgac ttttgtgtta 180 

aaaaatgaca atgataagtc agaaacaagt cacgaaacgg tagagggttc tggagaagca 24 0 

acctttgaaa acataaaacc tggagactac acattaagag aagaaacagc accaattggt 3 00 

tataaaaaaa ctgataaaac ctggaaagtt aaagttgcag afcaacggagc aacaataatc 3 60 

gagggtatgg atgcagataa agcagagaaa cgaaaagaag ttttgaatgc ccaatatcca 42 0 

aaatcagcta tttatgagga tacaaaagaa aattacccat tagttaatgt agagggttcc 480 

aaagttggtg aacaatacaa agcattgaat ccaataaatg gaaaagatgg tcgaagagag 54 0 

attgctgaag gttggttatc aaaaaaaatt acaggggtca atgatctcga taagaataaa 60 0 

tataaaattg aattaactgt tgagggtaaa accactgttg aaacgaaaga acttaatcaa 6 60 

ccactagatg tcgttgtgct attagataat tcaaatagta tgaataatga aagagccaat 72 0 

aattctcaaa gagcattaaa agctggggaa gcagttgaaa agctgattga taaaattaca 780 

tcaaataaag acaatagagt agctcttgtg acatatgcct caaccatttt tgatggtact 84 0 

gaagcgaccg tatcaaaggg agttgccgat caaaatggta aagcgctgaa tgatagtgta 900 

tcatgggatt atcataaaac tacttttaca gcaactacac ataattacag ttatttaaat 96 0 

ttaacaaatg atgctaacga agttaatatt ctaaagtcaa gaattccaaa ggaagcggag 102 0 

catataaatg gggatcgcac gctctatcaa tttggtgcga catttactca aaaagctcta 108 0 

atgaaagcaa atgaaatttt agagacacaa agttctaatg ctagaaaaaa acttattttt 114 0 

cacgtaactg atggtgtccc tacgatgtct tatgccataa attttaatcc ttatatatca 12 0 0 

acatcttacc aaaaccagtt taattctttt ttaaataaaa taccagatag aagtggtatt 126 0 

ctccaagagg attttataat caatggtgat gattatcaaa tagtaaaagg agatggagag 13 2 0 

agttttaaac tgttttcgga tagaaaagtt cctgttactg gaggaacgac acaagcagct 138 0 

tatcgagtac cgcaaaatca actctctgta atgagtaatg agggatatgc aattaatagt 144 0 

ggafcatattt atctctattg gagagattac aactgggtct atccatttga tcctaagaca 1500 

aagaaagttt ctgcaacgaa acaaatcaaa actcatggtg agccaacaac attatacttt 156 0 

aatggaaata taagacctaa aggttatgac atttttactg ttgggattgg tgtaaacgga 162 0 

gatcctggtg caactcctct tgaagctgag aaatttatgc aatcaatatc aagtaaaaca 168 0 

gaaaattata ctaatgttga tgatacaaat aaaatttatg atgagctaaa taaatacttt 174 0 

aaaacaattg ttgaggaaaa acattctatt gttgatggaa atgtgactga tcctatggga 18 00 

gagatgattg aattccaatt aaaaaatggt caaagtttta cacatgatga ttacgttttg 186 0 

gttggaaatg atggcagtca attaaaaaat ggtgtggctc ttggtggacc aaacagtgat 192 0 

9"ra9"ga-attt taaaagatgt tacagtgact tatgataaga catctcaaac catcaaaatc 198 0 

aatcatttga acttaggaag tggacaaaaa gtagttctta cctatgatgt acgtttaaaa 204 0 

gataactata taagtaacaa attttacaat acaaataatc gtacaacgct aagtccgaag 2100 

agtgaaaaag aaccaaatac tattcgtgat ttcccaattc ccaaaattcg tgatgttcgt 216 0 

gagtttccgg tactaaccat cagtaatcag aagaaaatgg gtgaggttga atttattaaa 222 0 

gttaataaag acaaacattc agaatcgctt ttgggagcta agtttcaact tcagatagaa 228 0 
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aaagattttt ctgggtataa gcaatttgtt ccagagggaa gtgatgttac aacaaagaat 2340 

gatggtaaaa tttattttaa agcacttcaa gatggtaact ataaattata tgaaatttca 240 0 

agtccagatg gctatataga ggttaaaacg aaacctgttg tgacatttac aattcaaaat 2460 

ggagaagtta cgaacctgaa agcagatcca aatgctaata aaaatcaaat cgggtatctt 252 0 

gaaggaaatg gtaaacatct tattaccaac actcccaaac gcccaccagg tgtttttcct 25 8 0 

aaaacagggg gaattggtac aattgtctat atattagttg gttctacttt tatgatactt 264 0 

accatttgtt ctttccgtcg taaacaattg taa 2673 



SeqID 39 

ggagcaaaaa aggcagggtg gactgaatac gctaggatgt tagaagtcag agagcaggtt 60 

gaccatgtga tgattccaaa aatcaatcag gatttaccaa tctacgctgg tccagaagag 12 0 

gacaatctgc aacggggagt tggtcatcta gaagggataa gtttgccgat tggaggggct 18 0 

tctacacatg cggtcttgag cggfccaaaga ggtatgccag ctgctcggtt gtttgcggat 24 0 

ttggataaga tgaaaaaagg tgattatttt tatgttacca atctgaaaga aaccttggct 3 00 

tatcaagtgg atcgtatcat ggtgattgaa cctagccaat tggatgccgt gagcattgaa 3 60 

gaggataaag attatgttac ccttctgacc tgtacacctt atatgggctc tttgtcaact 42 0 

gtaatgggtg acttatcatt aacaacgaga gagaatcagc ttggttctct ctctttttgg 48 0 

atgttcaaag cgatgagaat tttgctttta aaatttttaa agttgcgaaa gccaaaggct 540 

tgtcgcttga tgtctttgat aagcttgfctg gtcgcttcta atttggcatt ggaaagggat 6 00 

agctga 606 



SeqID 40 

atgtcggatg ttgtcgaaaa acaaacagct aaatcgttta taatgaacgt attaaatggt 60 

ttagctttag ggactgttat tgtcttgata ccaggagcaa tccttggaga attaatgaaa 120 

gctttactgc caatgtggag cggatttgca actttaatcg ctgcaacagc agttgcaaca 180 

agtatgatgg ggcttgttat tggtattatg gttggtctca attttaaatt taatcccatt 240 

caatcagcat cactaggatt agcggttatg tttgcaggtg gagctgcaac cttccttaaa 3 00 

ggcgctatta tgctaaaagg aacaggcgat atcattaata tgggaattac tgctgcctta 3 60 

ggtgtccttt tgatccaatt tttatctgac aagacgaaat cattcacttt gattgtcatt 420 

ccaacagtaa cactcttact agttggagga gtaggtcatg tgttattacc ttatgtcaaa 480 

atgattacca ccatgattgg acaagggatt gcttcattac ttggtttaca acctgttttg 540 

atgtcaattt taattgcgat gattttctgt ttcttgatcg tatctccaat cacaacagtt 600 

gggattgctt tggcaattag tttgtccgga attggttctg gtgctgctaa tttaggtatc 660 

tgtgctgcta gttttgggct ttgtatggct gggtggtcag tcaattctaa ggggacagcc 72 0 

ttagcgcatg ttttagggtc tcctaagatt tcgatggcta atgtattagc aaaaccgaag 780 

attatgctac ctatgatatc ttcagcagca atactaggaa ttctcggagc tctctttaat 840 

attcaaggaa caccagctag tgcaggtttt ggtatcagtg gtttgattgg accgattaat 9 00 

gctcttaatt tagcaaaagg tggatggtca gttatgaata tgttattgat aattataata 960 

tttgttgctg caccaattat attgaatttt atttttaatt acctctttat aaaagtactt 1020 

aaaattattg atccaatgga ctataaatta gatatttaa 1059 



SeqID 41 

atggctagac ccttagtaga acaaacagca gatcgtctat tgcatttaat tttagaaaga 60 

gaataccctg ttggtgctaa gctccccaat gaatatgaat tagcggaaga tctagatgtt 120 

ggtcgcagca caatccgaga agcagtcaga agtttagcaa cgcgtaatat tttagaagtc 180 

agacagggct ctggaacata tattagttct aaaaagggtg tttctgagga tccattaggg 240 

ttctctctaa taaaggacac agataggttg acatcagatc tctttgaact acgcctttta 3 00 

ttggaacctc gaatagcgga gctggtagct tatagaatta cagatgatca gttacaatta 3 60 

ctcgaaaagc ttgttggaga tattgaagat gctgttcacg caggcgatcc gaaacattta 42 0 

cttttagatg tggaatttca ttccatgcta gctaaataca gtggtaatat tgcaatggat 4 80 

agccttttgc cagtcatcaa ccagtcaatt catctgataa atgcgaatta tacaaatcgt 540 

cagatgaagt cggatagttt agaagcacat agagaaatta tcaaagcaat ccgagaaaaa 600 

aatccagtag ctgcacatga tgctatgctt atgcatatca tgagtgttag acgttcagca 660 

ttgaaagctt aa 672 



SeqID 42 

atgattaaga 

ggtggtgtat 

gcttataaag 

tttgaaaact 

tatcatgtag 

tcaaaatcga 

gaagaagcca 

cttaagatca 

ctgaaaaata 



aaaataaagt 
tattttacca 
atgctaaagt 
ttagagggag 
atgctaagac 
cttctcagtc 
aaaagattgc 
aagaagaaat 
aaaatgaagt 



ttttttagga 
atcacaattc 
ggctaagaag 
ctacgaaatt 
tggacaaatc 
aagttcatcg 
atttaaagat 
tgaaaatgga 
tgattatcaa 



gttcttttag 
caaaagacaa 
gatgtcatac 
gagttcaata 
cttgagcggg 
tcaagtcaaa 
gctaatatag 
aaatcagttt 
attgatgcag 



tccttgttgt 
ctaatcaagc 
atcaaaaaat 
ccaaaagcgc 
acatggataa 
aatctcataa 
aggaatcaga 
atgatattga 
aaactggaaa 



tatccttgga 
gttagctatt 
tgacaaagaa 
agaatacagc 
taatggcttt 
aataagtcaa 
agtcagtaat 
ctttgtggac 
aatcattgaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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cgttctagag atcatatgaa tgattaa 5 67 
SeqID 43 

ttgaatcgta aaaaaaggta tcgtctcact gtaaaaaagc aaaatgctag catacctaga 60 

cgattaaatc ttctattttt tattattgtt ttattattta ctgttctaat tcttcgttta 120 

gagcaaatgc agattgggca acaatcattt tacatgaaaa aactgacagc tcttacgagt 180 

tatacagtga aagaatcaaa agcgagaggt cagatttttg atgctaaggg agtagtctta 240 

gttgaaaatg atgaacgtcc tactgtggcg ttttctagag gaaacaatat ttcatctcag 3 00 

tctatcaaag agttagcgaa taaattatca cattacatta ccttaaccga agttgcttca 3 60 

agcgatagag caaagagaga ttattattta gcagataaag ccaactacaa aaaagttgtt 420 

gaaagtttac cagattctaa acgttacgat aaatttggta accatctagc cgaatcaacc 48 0 

gtatatgcaa atgcagttgc tgcagttcca gtgagcgcta ttaactattc tgaagatgag 540 

ttaaaagtgg ttgccttatt taaccaaatg aatgcgacac caacgtttgg ttcagtgaaa 600 

ttaagtacag gcgaattaag tgatgatcaa atcaagaaac tggatgctga taagaaggaa 660 

ctgttaggaa tttctgtaac aagtaattgg catcgtcgta aaaaaggaac ttctttatcg .720 

gatatattag gtactatttc tacagaaaag gcaggtttac caagagaaga agttaaaaaa 780 

tatttgaaaa aaggctattc actgaatgac cgagtgggaa cttcttacct tgaaaaacaa 840 

tatgaagatg acttacaagg aatcagacaa attcgcaaag tagttgttaa taaaaaaggt 9 00 

aaagttgttt cagataatat tacgcaagaa ggtaagtcag gaagaaatct taaattaacg 960 

attgatctta attaccaaaa caaagtagaa agtatcttga aacaatacta tggcagtgaa 102 0 

ttatctagtg gacgcgctag tttttccgaa gggatgtatg cagttgctat agaaccttca 10 80 

actggtaaag tgttagcaat ggctggtctg aaaaatgatc atgggaattt agttgatgac 1140 

agcctaggca ctattgctaa gaatttcacg ccaggatctg ttgtgaaagg tgcgacgcta 12 00 

tcatcaggat gggaaaataa agttcttaga ggaaatgaag tgctttatga ccaagaaatt 12 60 

gctaatatac ggtcatggtt tacacgaggt ttgactccaa tatctgcggc acaagcgcta 1320 

gagtattcat ctaatactta tatggttcaa gtagcacttc gtttaatggg acaagactat 13 80 

aatacaggag atgctttaac tgatcgaggc tatcaagaag caatggctaa actaaggaaa 1440 

acttacggcg aatatggttt aggggtttct acaggattag atttacctga atcagaaggt 1500 

tatgtacctg gaaaatacag cttaggaaca actctaatgg aatcgttcgg tcagtatgat 1560 

gcctatacac caatgcaact tggtcagtat atctcaacta ttgcgaataa tgggaatcgt 1620 

ttagcacctc acgtggtttc agatatctat gaagggaatg attctaataa gttcgctcaa 1680 

ttggttcgtt caatcactcc taaaacacta aataagatag ctatctcaga tcaagagtta 1740 

gccattattc aagaaggttt ttataacgtt gtcaatagtg gaagtggcta tgcaactggt 18 00 

acgtcaatga gggggaatgt gacaaccatt agtggtaaaa ctggtaccgc tgaaacattt 18 60 

gctaaaaatg taaatggaca aacagtttct acctacaact taaacgctat tgcctacgat 192 0 

actaatcgta aaatagcagt agcggtaatg tatccgcatg ttacaactga tacaacaaaa 1980 

tcccatcaat tagttgcacg agatatgatt gatcaatata tttcacagtt cacaggacaa 2 040 

taa 2 043 

SeqID 44 

atgacggttt ttcctaaaca ttttctatgg ggaggtgcag ttgctgccaa tcaggtagag 60 

ggtgcattta gaacagatgg caaaggttta tctgtacaag atgtacttcc aaatggtggt 12 0 

ttaggcgatt tcaccgccaa acctactcca gacaatttga agctagaagc aattgatttt 180 

tatcataact ataaaaatga tataaaactg tttgcagaga tgggctttaa agtttttaga 24 0 

acctctattg cctggtctcg aatttttcca aatggtgatg atagtgctcc aaacgaagct 3 00 

ggcttacaat tttatgataa tttatttgat gagttgctta aatataatat cgagcccttg 3 60 

gttaccttat ctcattatga aacaccgctt catcttgcaa aaacttataa tggttgggct 42 0 

gatagacgtt tgatagcatt ttttgagaaa tttgctcaaa cagtcatgga gcgttataaa 48 0 

gataaggtga aatattggtt gacttttaat gaagtgaatt ctattttaca tatgcctttc 54 0 

accagtggtg ctattatgac cgacaaatca caactaagtc ctcaggagct ttatcaagct 60 0 

attcatcatg aattagtagc atcggctaga gtgacaaaat tagggcgttc tatcaatcct 660 

aattttaaaa ttggctgtat gattctggct atgccagctt atcctatgac atctgatcca 72 0 

agagatgtcc tggctgcaag acaatttgaa caacataatc tgctattttc agatatacat 78 0 

gtcagaggta aatatcccac ttatattcag tcctatttca aaaataatgg tattaaaatt 84 0 

aaatttgaag aaggagatga ggaagtattg gcacaaaata cagtagattt tctatcgttt 90 0 

agttactata tgagcgtgac acaagcttat gactttgaaa attatcaaag tggtcaggga 96 0 

aatatccttg gaggcctaac taatcctcat ttaacaactt cagaatgggg atggcaaatc 102 0 

gatcccatcg ggctacgcct agtcttaaat cagtattatg aacgttatca gattccgttg 108 0 

tttattgtag aaaatggatt gggagcaaaa gatcaattga tagaaacact agacggtgat 114 0 

tatactgtcg aagatgatta tcgtattgat tatatgaatc aacatttagt tcaagttgct 120 0 

aaagctattg aagatggtgt tgaaataatg ggatatacat cttgggggtg tattgattgt 12 60 

gtatccatgt ctacagctca actaagtaaa cgttatggtc taatatatgt tgatcgaaat 132 0 

gatgatggaa cagggagtct acaacgttac aagaagaagt catttggttg gtaccaaaaa 13 80 

gtgattaaga caaatggcca atcattgttt gaacatcata acagataa ~ 1428 
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SeqID 45 

atggctactt 

gaccagactg 

tttaaagtct 

tttttaccaa 

ttctcatttt 

ttttgggata 

cagtggtttc 

ttgacaattt 

aactaa 



ttcaaattaa 
ggcttccctg 
ttgacgaaga 
agtttaatgt 
tgaaacctca 
tagatttcca 
gtatgacttc 
cgcttgtcat 



agaaaaaatg 
ttatcacgtt 

ggggcatctg 

gacattagcc 
ttacaccatt 
actcttaaaa 
aacttatcag 
agcgatcgac 



ttttctctag 
gaaggaagtt 
attagtcaga 
aatgggaatc 
gaagaccttg 
gacaatcaag 
gtagaagttt 
tatgtcaaag 



gcggtaaatt 
tgttccctct 
ttgagaagaa 
atttcacgat 
atatggaagt 
ttatagcaaa 
acaatgaaac 
agttagaaaa 



tactattaca 
acctaagact 
ggttttaagt 
taagaaagat 
taagggaaat 
catatctcaa 
ttacaacgat 
aaatgcatca 



60 
120 
180 
240 
300 
360 
420 
480 
486 



SeqID 46 

atgaaaaaaa 

gttaaagctg 

aaagcatttt 

tttaataatt 

atctacaatt 

aatgcctatt 

cgtaactatg 

gcatcgcatc 

gaacatgttt 

aattag 



taacaacttt 
atgataactt 
atcaaagact 
cagaaataag 
taggcattct 
ctttggaaaa 
atttatttga 
aagataatac 
atctcaatga 



aatcttagct 
tgaaatgcca 
acaagaaaaa 
gcatcaacta 
tatttctcag 
tgtctcagat 
aaaaatcaaa 
agacttaaaa 
aagtagagtg 



agtagcttat 
acgcgttatg 
caacgtaagg 
cctcttaaac 
gagtctaaag 
attgttaatg 
tctacacgtg 
aacttttatg 
gaggcgataa 



tactagttgc 
ttaaaatgag 
cacatactac 
aagaaaaggc 
ggttcatcca 
aagctcaggc 
ataaggttca 
ctgagttaga 
acagaaatat 



aacgacatcg 
tgaaaaatca 
tgtgaagact 
tagaaatgat 
acgtattgat 
tttgtataaa 
agtcttactt 
tgatatgtat 
ccaaaaatat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
546 



SeqID 47 

atgaaattaa 

gtttttatgc 

tatctattag 

catattaatg 

gaacctgcct 

ttcgcttcat 

attaataaag 

gtggggactg 

tcaaaaccag 

tggcttttaa 

ttatacttta 

ttattttcaa 

agggttgaga 

ctatggttta 

caagaaattg 

aaatccaaag 



aaaaattctt 
gtcattttca 
tgacagcttt 
tgtcggattt 
cacgcttagc 
taacagcttt 
catacggtgt 
ggctgatcat 
ttttacagat 
atcttgcaca 
ttttaccfcaa 
cgtttgttat 
aaatggtaga 
ttttcttagc 
cgacaggaaa 
aagaaaaata 



tgaggatttg 
aagtgcagag 
tccattacta 
attgtcactt 
agtagatgct 
ttggacaatg 
tgaccaacac 
cctattttta 
aattgtaaac 
accagttaca 
cgcacgaatt 
tggttttttt 
tattaaaacg 
ccacattatg 
aattgaatca 



ctggctaaac 
atggacttat 
gttattgctg 
atgcaaaaaa 
ttttcgaaac 
tctaaaagtt 
agagactttg 
ctgacttttg 
atgtatgatt 
tttttgacta 
agaaaagtac 
agtaatttaa 
tttggttcgg 
attttaggtg 
aggcgtggag 



tagaatatag 
cagccatagc 
ctaatatttt: 
atcttccgaa 
cgtctaccgg 
tgacctctcfc 
t gat tag teg 
ttttaatttt 
tgggagatac 
tatttttagg 
gttatgtcat 
ttagecaata 
tegttatett 
caatactgaa 
atattatgtc 



acccattcaa 
agttgcttat 
tccttatttt 
aaatatttat 
cattttggga 
teaaaaagee 
tttagtgggt 
ttcaactttt 
tetaaeggeg 
tattggaatt 
tccagggaca 
cgtcttaaat 
tatcttaatg 
tgccagcgtt 
tcttatccaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
921 



SeqID 48 

atgaaattac 

gcgcatgctg 

gaaaaagagc 

actegctttg 

ttaaacgata 

caattaaaag 

ctatacaaag 

cctgaaaaat 

ggagaatacg 

ctagatgtgg 

gaagaagaag 

tatgetagee 

gttgattttt 

aagtgggaaa 

tttacaaata 

tgeattaate 

cagggatatc 

caattaagca 

atggctgcgc 

tttagagctg 

gatgatttag 

ttctttacag 

cgtcagtccc 



tttatactga 
ctgaagctgg 
gtgeagtatt 
ctcaattggc 
ttggattagc 
tttttggacg 
aattgeagae 
ttgaagattt 
acaatcaatc 
atttaaagaa 
ctttaattaa 
aaaaagcgta 
taagatatct 
gtaagtctgg 
gtagtcacat 
aaaaagatga 
gctacaaaga 
agatttttga 
accctttagt 
aagatgtttt 
attattttga 
attttgttgt 
tccttgctcc 



tatfcaatcat 
atggcgtatt 
agagaacctt 
acgttatttt 
aatgatcttt 
attgaagcaa 
agcaaatctg 
attagctatt 
taaaattgee 
cactatcctt 
aagtctgagt 
taaggctaac 
agcaacgaca 
ttttgaaatg 
tttagacgac 
agtggaacat 
tattttagtt 
gcagtatgat 
tcattttatg 
aaatcttttt 
ggcttatatc 

gggggctaaa 

actggaaagt 



gatatgacag 
ttttatattg 
cctcaagaag 
actttaaacc 
tategtgett 
gatgettect 
agtattttag 
tttctagtgg 
ttttttacag 
atagtcgatg 
tcacgttgtc 
tttactaatg 
tttcaaacaa 
atttcgaaaa 
actgetaaag 
gtggctagag 
ttactaggag 
attccttatt 
gattccttgt 
aagacaggta 
teatatgeag 
aaatttgatt 
tttgttaaaa 



aaatcttagt 
ctccaaactc 
cctcttttgc 
agectaatea 
tagcaagttt 
ttatcagtca 
atttaaaata 
tttcagatct 
aacaagtaag 
gttttacccg 
aagagattat 
gaaatattta 
aaccagaatt 
atatcgaagg 
attgtataac 
etataegtea 
atgtagatag 
attteggtaa 
ctcgaataaa 
tttatggaga 
acattaaagg 
tagggegtet 
cgaaaaaaca 



caatcaggcc 
tttatctttt 
tattacaatt 
aaaagaaagt 
tgaagatgga 
attggtagac 
tttacattct 
tttaagagaa 
gtcaggccag 
tttctcagct 
categgaget 
ttctgcaggg 
tatcttgtct 
taaacatgat 
aatttgggag 
gaaactttat 
ttataaacta 
agctgaaaca 
aegttatega 
gattagtcaa 
tcctaagaag 
taacactatt 
agaeggtata 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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aaaacactca accaattcat gtttttcctt actcaggtag gtttgagtga caatttatcg 1440 

cgattagttg gtcaaatgtc tgaaaatgaa caagaaaaac atcaagaagt ttggaaaaca 15 00 

tttacagata ttttagagca gtttcaaaca atatttggtc aagaaaaatt aaatttagat 15 60 

gaatttttat cgttattaaa ttcaggtatg atgcaagccg agtatcgtat ggtccctgct 1620 

actgttgatg tggttactgt taagtcttac gacttggtag aaccacatag taatcaattt 1680 

gtctatgccc taggcatgac acaatcacat tttccaaaga ttgctcaaaa caaatcttta 1740 

atatctgata ttgaacgtca gcttataaat gatgccaatg acactgatgg tcattttgac 18 00 

attatgacgc gagaaaattt gaagaaaaac cattttgctg ctctctctct ttttaatgct 1860 

gctaaacaag cattagtgtt gactatacca caattattga atgagtctga agatcagatg 192 0 

tcaccatatc ttatcgaact aagagatata ggtgttcett ttaatcataa aggacgccaa 198 0 

tcgttaaaag aagaagctga taatattggg aattacaaag ctttattatc gcgtgttgtt 2 04 0 

gacttgtatc gctctgctat tgacaaagag atgactaaag aagaacaaac tttttggtca 210 0 

gttgctgttc gttatttgcg tcgtcaatta acttctaaag ggattgagat accaataatt 2160 

acagatagtc ttgatactgt gacagtttca tcagatgtga tgaccagacg cttcccagaa 222 0 

gatgacccac ttaaattatc ctcatctgct ttgacaactt tttataataa tcaatataag 2280 

tattfctttac aatatgtgtt aggtttggaa gaacaagact caattcatcc agatatgcgg 2340 

caccatggaa cgtaccttca tcgtgttttt gaaattctga tgaaaaacca agggatagaa 240 0 

tcttttgaag agaaactcaa ctctgcaatt aataaaacca atcaagaaga tgttttcaaa 2460 

tccttgtatt cagaagatgc agaaagtcgt tattctttag aaattttaga agatatcgcg 252 0 

cgtgcaactg caaccatatt acgacaagat agtcagatga ctgttgaaag tgaagaagaa 2 58 0 

cgatttgaac taatgattga taatactata aaaattaacg gtattattga ccgtatcgat 2 64 0 

cgtttatctg atggttcttt gggtgtcgtg gactataaat caagcgctca aaaattcgat 270 0 

atccaaaaat tttataacgg tctaagtcca caattagtta cctacattga tgctatcagt 2 76 0 

cgtgacaagg aggttgaaca aaaaccaccg atttttggag ctatgtattt acatatgcaa 282 0 

gaacctaaac aagacttgtc taaaattaaa aatttagatg atttagtaac aaaaaatcat 288 0 

caagctttaa cttataaagg attattttca gaagctgaaa aagaattttt agcaaatggg 294 0 

aaataccatc tcaaagattc cctttattca gaggctgaaa ttgctatttt acaagcacat 3 00 0 

aaccaattgc tttacaaaaa agcatctgag actattaagt caggtaagtt tttgatcaat 3 06 0 

ccatatacag aagatgctaa aacagttgac ggtgatcagt ttaagagtat tacggggttc 312 0 

gaggcggata ggcatatggc gcgtgccaga gcgctttata aattacctgc taaggaaaaa 318 0 

cgtcaaggtt tcttaacatt aatgcagcaa gaggaggaaa atgatgacct ttaa 3 23 4 



SeqlD 49 



atgagcgaaa 
atgcgcaaag 
ttcggtacat 
atctcagagg 
atcgttgatt 
ggtgctaaaa 
gcatcagggt 
acacacattc 
ttgaaatctt 
ggtaagaaag 
attaagcgtg 
atgcaagctg 
acacttattc 
attttggtta 
gctgaaagtg 
gttccagtac 
aaagatgcta 



ctaaagttat 
atgaaaaagt 
cagttggtat 
ctgcgattgc 
tgacattcat 
cgaattatat 
caggtattgg 
caggtctaaa 
ctattcttga 
aagaagttaa 
agggtactga 
ctgaagaagt 
cgcttgataa 
acgatgctta 
aagcatttga 
cgtattcacg 
tctacaaagt 



ggctfctgcgc gaagcgatta 
atttttaatg ggtgaagafcg 
gttagaagag tttggtgcta 
aggttcagct atcggtgcfcg 
ggatttcgta acaatcgcaa 
gtttggtgga ggtctatcaa 
ttcggctgca caacactcac 
agttgttgca cctggtactg 
caataaccca gttatcttct 
catggatcct gattfcctata 
cttaacaatt gtttcttatg 
tgctgaggaa ggtattaacg 
agaattaatt atcgattctg 
taaaactggt ggtttcactg 
ttaccttgat catcctatcg 
tgtacttgaa caaggaatct 
agttaacaaa ggtaaataa 



atgtcgctat 
taggtgtata 
agcgcgttcg 
cacaaacagg 
tggatgctat 
ctccagttac 
aatcattaga 
ttaatgagtc 
tagagccaaa 
ttccacttgg 
gccgtatgct 
ttgaggttgt 
ttaagaaaac 
gtgaaattgc 
tgcgtcttgc 
tacctgatgt 



gagcgaagaa 
tggaggagac 
cgacacacca 
tttgcgtcct 
cgttaaccaa 
tttccgtgtg 
agcttggtta 
aaaagcactt 
agctctctat 
taaaggggat 
tgagcgtgtt 
tgatccacgt 
tggtaaattg 
aactatggtt 
aagtgaagat 
agcaaaaatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
999 



SeqlD 50 



atggcttttg 
gagtggaaaa 
tctgataaaa 
catcaggcag 
gaagaagtag 
tcagtttctg 
gaacttgcag 
gcagcaatcc 
ggtggtacgt 
atcctagatg 
gcaattgata 
ggtgtacgtg 
aaccctgata 



atgtaattat 
aaaatgaagg 
ccaatatgga 
gagatgttgt 
gtacgtcatc 
gaccfcgctgc 
ctgatgagta 
gcggtgctca 
gtttgaatgt 
gtttgaaagt 
tggataagac 
gtctcttgaa 
agtctgttgt 



gccaaaactt ggggttgaca 
tgacaccgtt aatgaaggtg 
aattgaagca gaagacactg 
tcccgtcact gaagtgattg 
accttcagct gatgcaacta 
tccttcacaa gagacagttg 
cgatatcgta gtcgtaggtg 
acttggagga aaaattgcca 
tggtfcgtatc cctacaaaaa 
agcagctgga cgtggtatca 
tgttgccttt aaaaattctg 
agcgaacaaa gttgaaattt 
tattggcgat aaagttatca 



tgcaagaagg 
atgtccttct 
gtgtactttt 
cctatatcgg 
tcacagctga 
cagcagcaac 
gtggacctgc 
ttgttgagaa 
catatcttaa 
accttgcttc 
tagttaagac 
tcaatggact 
aaggtcgtaa 



cgaaatcctt 
tgaaatcatg 
aaaaattgta 
agaagaaggt 
agatggtcaa 
tcctaaagaa 
aggttattac 
aactgaattt 
aaatgctgaa 
aactaactat 
actgactggc 
tggacaagta 
cgtcgttctt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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gcgactggtt ctaaggtatc tcgtattaat attccaggta ttgaatcacc attagtatta 840 

acatcagatg atattcttga tttacgtgaa attcctaagt cacttgctgt tatgggtggt 9 00 

ggtgtcgtag gtattgagct tggtcttgta tgggcttcat atggtgttga cgtaactgtt 96 0 

attgaaatgg ctgatcgcat cattccagca atggataaag aagtatcgct agaacttcaa 102 0 

aaaatccttg ctaagaaggg aatgaagatt aaaacatcag ttggtgtttc tgaaattgtt 108 0 

gaagccaata atcaattaac gcttaaatta aacaacggcg aagaagttgt tgctgataaa 114 0 

gctcttcttt caattggacg tgttccacaa atgaatggtc ttgaaaatct tgagcctgaa 12 00 

cttgagatgg aacgtggacg tattaaagtt aatgcttatc aagaaacttc aattccaggt 1260 

atctatgcac caggtgatgt taatggaact cgtatgttag ctcatgctgc ataccgtatg 1320 

ggtgaagtag ctgctgaaaa cgcactcggc ggtaataagc gtaaagcaca tctagatttc 13 8 0 

actccagcag ctgttfcatac tcatcctgaa gtagcaatgg tggggatgac agaagagcaa 1440 

gcacgtgaac aatatggtga catattagtt ggtaaaaata gctttacagg caatggtcgt 1500 

gctatcgctt caaacgaagc acatggtttt gtaaaagtaa ttgctgaacc taagtataaa 1560 

gagattcttg gtgttcacat tattggtcca gctgcagcgg aattgattaa tgaagcttct 162 0 

acgattatgg aaaatgagtt gactgtctat gatgtagcac aatctatcca tggtcaccca 168 0 

acattctctg aagttatgta cgaagctttc ttagatgttc tcggtgaagc tattcacaac 174 0 

cctccaaaac gcaaataa 1758 

SeqID 51 

atgggaaaat attttggtac agatggtgtt cgcggtgaag ctaatgttga actgacgcca 6 0 

gaattggctt ttaaattagg acgttttgga ggatatgttc taagtcaaca cgaaactgat 120 

cgtccgagag tttttgtagc tcgtgatact cgtatttctg gtgaaatgct ggaatctgct 180 

ttgattgctg gtctactttc agtaggtatt gaggtttata aacttggtgt tttagccact 240 

cctggagtgt cttaccttgt ccgtacagaa aaagctagtg caggggttat gatttcagca 3 00 

agccacaatc cagcacttga taatggtatt aagttctttg gtagtgatgg ttttaaactfc 360 

gafcgatgatc gtgaattaga aattgaagct ttacttgatg cgaaggaaga cactttacca 420 

cgtccatcag cacaaggttt aggtacttta gtagattatc ctgaaggact tcggaaatat 480 

gagaagttta tggaatcaac tggtattgat ttagaaggta fcgaaggttgc gttggataca 540 

gcaaatggag cagcaacagc ctcagctcgg aatatctttt tagacttgaa tgctgatata 600 

agtgttattg gtgatcaacc agatggtttg aatatcaatg atggtgttgg ttcgacacat 660 

cctgagcaat tacaaagttt agttagagaa aatggttccg atatcggtct agcttttgat 72 0 

ggtgatagtg atcgtttgat cgctgttgat gaaaatggag agattgtaga tggtgataaa 78 0 

attatgttta tcattgggaa atatctttct gataaagggc agcttgctca gaatacaatc 84 0 

gtaacaacgg ttatgtcaaa tctcggcttt cataaagcgc tcgaccgtga aggtattcafc 900 

aaagcgatta ccgctgtagg agaccgctac gtggttgaag aaatgcgtaa atcaggatat 960 

aatcttggcg gggagcaatc aggccacgtt attatcatgg attacaatac tacaggggat 102 0 

ggacaattaa cagccatcca attgactaag gtcatgaaag aaactggtaa gaaattatca 108 0 

gagttagcta gtgaagtgac gatttatcct caaaaattag taaatattcg tgttgaaaat 1140 

aacatgaaag ataaagcgat ggaagttcct gcaattgcgg aaattattgc taaaatggaa 12 0 0 

gaagagatgg atggcaatgg tcgtattcta gtacgtccta gtggtactga gcctcttctt 12 60 

agagttatgg cagaagcgcc aacaaatgaa gcagttgatt attatgttga tactatcgca 1320 

gatgtcgttc gtacagaaat tggtttagat taa 1353 

SeqID 52 

atgaaaggtc aaaaaattat tgctctagct ggtcttgtcc tgtcatgcca ttttgctctt 60 

acggcatgtc atactcaaga acacaaaaat tctcatcata ttaaaacaaa gcaggttgct 12 0 

aagaaaaaag ctaataaaaa gaaagtctct gttaaggaaa gtcataaaaa acgcaagggt 180 

gttgcagggg ttgactttcc aactgatgat ggctttttac tgactaaaga ttcgaagatt 24 0 

ttatctcatc ccgattctgg aatcgttgta gcacatggaa atcattctca ttttattttt 300 

tactctgatt taaaaggttc aaaatttagc tatttaattc ctaatgcaaa tgcaaaaact 36 0 

aataaaaagc aagcggtaag aaactttaaa gcaggggcgg tagctgttaa tacattaaat 42 0 

gatggctatg tttttaatcc cgcagatatt gtttcagaag atgctaatgg ctacgtggtc 48 0 

agacatggtg atcatttcca ctatattcca aaagcaagtt tatctcagca aaagcaagta 54 0 

caagcaagta gagctgtttc acgtttagga aatcaaaata atagccatta cagagttaat 60 0 

agttctaaaa ttgcagggct tcaccaccca acaagtgatg gtttcttatt caatggacaa 660 

ggaataaaag gcactacccc aacaggtatt ttagtagaac atcataatca tctgcatttt 72 0 

attagttttg ctgatttaag aaaaggggga tggggatcaa ttgctgaccg ttaccaacca 78 0 

caaaagaaag ttgattctaa aaaacagtca ccatcaagta aaaagccaag aactgaaaat 84 0 

actttaccta aggacatcaa agataagcta gcttaccttg ctcgtgaatt acatttagat 900 

atttcacgta tcagagtcct taaaacacta aatggfcgaaa ttggatttga atacccacat 960 

gatgatcata cacatgttat tatggccaaa gatattgatc taagtaagcc aattccaaat 102 0 

ccacatcatg atgacgaaga tcatcacaaa ggtcatcacc atgatgagtc cgaccataag 1080 

catgaagaac acgagcacac fcaagtcaaat aaattatctg atgaagatca aaagaaatta 114 0 

atttatttag cagagaagct tggtttaaat cctaatcaaa ttgaagtatt aacgtcagag 120 0 

gatgggagta tcatctttaa atatccacat gatgaccact cacacactat agcaagtaag 1260 
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gatattgaaa ttgggaagcc tatcccagat ggacaccacg atcactctca tgcaaaagat 13 2 0 

aaagttggta tggcaactct aaaacaaatt ggttttgatg atgagattat tcaggatatt 13 80 

ctgcatgcag atgctccaac accattccca tccaatgaaa ctaatcctga aaaaatgcgt 1440 

cagtggcttg caactgttac taagattaac attggacaaa gaaccaatcc attccagcgt 1500 

tttggtctat cattgatgcc taatattgaa gttctaggaa ttgggtttac tccgattaat 1560 

gatatgacgc cagttttgca attcaaaaaa cttaaacagc tttggatgac aaatactggt 1620 

attacagatt attctttcct tgataagatg ccactattag aaggattgga tatttcacaa 168 0 

aacggtatta aggatttatc ttfcccttaca aaatataaac aattaagtct tattgctgca 1740 

gctaataatg gcattacttc gctaaaacca ttggctgaat taccaaacct tcaattctta 1800 

gttttgagtc ataacaatat ctctgacttg acaccgttat caaacttgac aaaactccaa 18 60 

gaattgcacc ttgatcataa taatgtgaag aatttaagtg cactttctgg taaaaaagat 1920 

ttgaaagttc tagatttatc aaacaataaa tccgcagact tatcgactct aaaaacaact 1980 

agtcttgaga cgcttctttt aaatgagacg aatacaagca atctaagttt cttaaagcaa 2 040 

aatcctaagg tatctaattt aacaattaat aatgcgaaat tagcatcatt agatggtatt 2100 

gaagagagcg atgaaattgt taaagtagaa gctgaaggga accaaattaa gtcactagta 2160 

ttgaaaaata aacaaggttc gttaaaattc ttgaatgtga ctaataatca gttaacatca 2220 

cttgaaggtg ttaataatta tacttcactt gagaccttaa gtgtttctaa aaataaactt 2280 

gaatctfctag atattaaaac acctaacaaa acagttacaa atcttgattt tagccataac 2340 

aatgttccaa catcacaatt gaaattgaat gaaaaaaata ttccfcgaagc agtagctaag 24 00 

aattttccag cagttgtaga aggttcaatg gttggaaatg gtagtcttgc tgaaaaagca 2460 

gctatggctt ctaaagagga caaacaagtt tcagataata ctaatcacca aaaaaacact 2520 

gaaaaatctg ctcaagcgaa tgctgatagt aaaaaagaaa accctaaaac acatgatgaa 2 5 80 

catcatgacc atgaagaaac agatcatgca catgtaggtc atcatcacca ttaa 2 634 

SeqID 53 

atgaataaac gcgtaaaaat cgttgcaaca cttggtcctg cggtagaatt ccgtggtggt 60 

aagaagtttg gtgagtctgg atactggggt gaaagccttg acgtagaagc ttcagcagaa 120 

aaaattgctc aattgattaa agaaggtgct aacgttttcc gtttcaactt ctcacatgga 180 

gatcatgctg agcaaggagc tcgtatggct actgttcgta aagcagaaga gattgcagga 240 

caaaaagttg gcttcctcct tgatactaaa ggacctgaaa ttcgtacaga actttttgaa 3 00 

gatggttcag atttccattc atatacaaca ggtacaaaat tacgtgttgc tactaagcaa 3 60 

ggtatcaaat caactccaga agtgattgca ttgaatgttg ctggtggact tgacatcttt 42 0 

gatgacgttg aagttggtaa gcaaatcctt gttgatgatg gtaaactagg tcttactgtg 480 

tttgcaaaag ataaagacac tcgtgaattt gaagtagttg ttgagaatga tggccttatt 540 

ggtaaacaaa aaggtgtaaa catcccttat actaaaattc ctttcccagc acttgcagaa 600 

cgcgataatg ctgatatccg ttttggactt gagcaaggac ttaactttat tgctatctca 660 

tttgtacgta ctgctaaaga tgttaatgaa gttcgtgcta tttgtgaaga aactggcaat 72 0 

ggacatgtta agttgtttgc taaaattgaa aatcaacaag gtatcgataa tattgatgag 78 0 

attatcgaag cagcagatgg tattatgatt gctcgtggtg atatgggtat cgaagttcca 84 0 

tttgaaatgg ttccagttta ccaaaaaatg atcattacta aagttaatgc agctggtaaa 9 00 

gcagttatta cagcaacaaa tatgcttgaa acaatgactg ataaaccacg tgcgactcgt 960 

tcagaagtat ctgatgtctt caatgctgtt attgatggta ctgatgctac aatgctttca 102 0 

ggtgagtcag ctaatggtaa atacccagtt gagtcagttc gtacaatggc tactattgat 108 0 

aaaaatgctc aaacattact caatgagtat ggtcgcttag actcatctgc attcccacgt 114 0 

aataacaaaa ctgatgttafc tgcatctgcg gttaaagatg caacacactc aatggatatc 1200 

aaacttgttg tgacaattac tgaaacaggt aatacagctc gtgccatttc taaattccgt 126 0 

ccagatgcag acattttggc tgttacattt gatgaaaaag tacaacgttc attgatgatt 13 2 0 

aactggggtg ttatccctgt ccttgcagac aaaccagcat cfcacagatga tatgtfctgag 13 8 0 

gttgcagaac gtgtagcact tgaagcagga cttgttgaat caggcgataa tatcgttatc 144 0 

gttgcaggtg ttcctgtagg tacaggtgga actaacacaa tgcgtgttcg tactgttaaa 1500 

taa 1503 

SeqID 54 

atgactgcaa ctaaacaaca taaaaaagtt atcctcgttg gtgatggtgc cgtaggttct 60 

tcttacgctt ttgcacttgt taaccaaggt attgcgcaag agttaggtat cattgaaatc 12 0 

ccagctttat ttgataaagc tgttggtgat gctgaagatc tttcacatgc ccttgcattt 18 0 

acatcaccta aaaaaatcta cgcagctact tatgcagact gcgcagatgc tgaccttgtt 24 0 

gtcattactg ctggcgcacc tcaaaaacca ggtgaaactc gacttgacct tgttggtaaa 300 

aacttagcaa ttaacaaatc aatcgtaact caagttgttg aatcaggttt caacggtatc 3 60 

ttcttagtag cggctaaccc tgtagatgta cttacatatt caacttggaa attctcaggt 420 

ttccctaagg aacgcgttat tggttcaggt acttcacttg actctgcacg tttccgtcaa 48 0 

gctttagcag ataaaattgg tgttgatgct cgttcagttc acgcatacat cafcgggtgaa 54 0 

cacggtgact cagaatttgc cgtttggtca catgctaacg ttgctggtgt ccaacttgaa 60 0 

caatggttac aagaaaatcg cgatattgat gaacaaggac ttgttgattt gtttatttca 66 0 

gttcgtgacg ctgcatactc aatcatcaac aaaaaaggtg ctacatatta cggtattgct 720 
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gtcgcacttg ctcgtattac taaagctatc cttgatgatg aaaatgcagt tcttccatta 78 0 

tctgtatatc aagaaggcca atacggtgat gttaaagatg tctttatcgg tcagcctgca 84 0 

atcgtaggtg cacatggtat cgttcgtcca gttaatatcc cattaaatga tgctgaactt 900 

caaaaaatgc aagcctcagc tgaacaatta aaagatatta ttgacgaagc ttggaaaaac 960 

ccagaattcc aagaagcatc aaaaaactaa 990 

SeqID 55 

atgcaagata aaaatttagt agatgttaat ctaactagfcg aaatgaaaac gagttttatc 60 

gattacgcca tgagtgtcat tgttgctcgt gcacttccag atgttagaga tggtttaaaa 12 0 

cctgttcatc gtcgtatttt atatggtatg aatgaactag gfcgtgacacc tgataaacct 18 0 

cataagaaat cagcacgtat tactggtgat gttatgggta aataccatcc acatggtgat 24 0 

tcatctattt acgaagcaat ggtgcgtatg gcacaatggt ggtcatatcg tcatatgctc 3 00 

gttgatggtc atggaaactt tggttcaatg gatggggatg gtgctgccgc acagcgttat 3 60 

acagaagcac gtatgagtaa aattgctctt gagatgcttc gtgatatcaa taaaaataca 42 0 

gttgattttc aagataacta tgatggcagt gaacgtgaac ctcttgtttt accagcgcgt 480 

ttccctaatt tattagtcaa tggtgcaact ggtattgctg taggtatggc aacaaatatt 54 0 

ccaccacata atctaggtga gtctatagat gctgtcaagt tggttatgga taaccctgat 600 

gttactacac gggagttaat ggaagttatt cctggtccag attttcctac tggtgccttg 66 0 

gtgatgggac gttcaggtat tcaccgtgcc tatgaaactg gaaagggatc aattgtctta 72 0 

cgttcacgga ctgaaattga aacaacttca aatgggaaag agcgtattgt tgtcacagag 78 0 

tttccatacg gtgttaataa aacaaaagta cacgagcata tfcgtacgttt agcacaagaa 84 0 

aagcgtattg aaggtattac tgctgttcgt gatgaatcaa gtcgtgaagg cgttcgtttt 900 

gttattgaag ttcgtcgtga tgcttcagca aatgtgattt taaataatct tttcaaatta 960 

actagcttgc agacaaattt tagcttcaat atgcttgcta ttgaaaaagg tgttccgaaa 102 0 

attttgtcat tacgacaaat cattgataac tatattgaac accaaaagga agttattgtc 10 8 0 

cgtcgtactc agtttgacaa agctaaagca gaagctcgtg ctcatatctt agaaggttta 114 0 

cttgtggcgc ttgatcactt ggatgaagtt atcacgatta ttcgtaatag tgagacagac 12 0 0 

accattgctc aagcagagtt gatgtcacgt tttgaattat cggagcgtca gagtcaagca 12 6 0 

attttagata tgcgtcttcg tcgtttaact ggattagaac gcgataaaat ccaatcagaa 1320 

tacaatgact tgctagcttt aattgctgat ttagcagata tcttagctaa gcctgaacgt 13 8 0 

gttgttacca ttataaaaga agagatggat gaggttaagc gtaagtatgc tgatgcccgt 144 0 

cgtacagaat taatgattgg tgaagtttta tcacttgaag atgaagacct tattgaggaa 15 0 0 

gaagatgttc ttattactct ctcaaacaag ggatatatca agcgtctcgc tcaagatgag 1560 

tttagagcac aaaaacgtgg aggacgcggt attcaaggga ctggagttaa taacgatgat 162 0 

tttgttcgtg aattggtttc aacaagtact cacgatactg tgcttttctt tacaaattta 168 0 

gggcgtgtgt atcgtttgaa agcatatgaa attccagaat atggacgtac agctaaaggc 174 0 

ctccctatcg ttaacctttt aaaacttgac gaaggtgaaa caattcaaac aattattaat 1800 

gctagaaaag aagatgtcgc taataaatat ttctttttca caactcagca aggtatagtg 18 6 0 

aaacgtacaa gtgtttctga atttagtaac attcgtcaaa atggacttcg tgctattaat 192 0 

ttgaaagaga atgacgagtt gattaacgtc ctcttgattg acgaaaatga agatgttatt 198 0 

atcgggacac gtacaggcta ctctgttcgc ttcaaagtga acgctgtacg taatatggga 204 0 

cgcacagcga ctggggtacg aggcgttaat cttcgtgaag gtgataaagt tgtgggagct 210 0 

tctcgtatcg ttaatggaca agaagtactg atcatcactg aaaaaggtta tggcaagaga 2160 

acagaagctt cggaatatcc tacaaagggt cgtggtggta aaggaataaa aaccgctaat 222 0 

attactgcta aaaatgggcc attagcacgc ttagtgacga ttaatggaaa tgaggacatc 22 8 0 

atggtcatta cagatacagg cgttattatt cgaaccaatg ttgctaatat ttctcaaact 234 0 

ggtcgttcta ctatgggtgt taaagtgatg agacttgatc aagaagctaa aatcgttact 2400 

gttgcgctag tggaacaaga aattgaagat aagtctaata tagaggatac aaaagaatag 2460 

SeqID 56 

atgacaatct ttgatgaaag agagttaaaa gaacgattta ctcatgaaaa tagggttagt 6 0 

ttttatgagt ttgtggctaa atatgatgct caaatggttc ctgtgatgaa agcaaagggc 12 0 

tatcgatgta ttcattcaat ggagcgtaca gtggttttta cttttgggga attcacgatt 180 

aggcgacgtc ggtggcaaaa aggagaacat tgggtggtgc cagttgatga aaagctagga 24 0 

ctgaaaaaga atgttcgcta ctctttagaa ttcatgtatc aaattgctag tttagcaacc 300 

atgatgcctt atgaaaaagt gattaaggtt gttcagatga tgtattgtat tgtgattacc 360 

aaacctactg ttgtgaaggc cgttaagatt agtcgtgaat tgcttaaaga aaaggaagcg 42 0 

tatcgttttt ttgatgaaga tataccagta gataaagaac cagttgatat gatttatctt 480 

gaaggagatg gagtcatggt caaagctcga gaagaaggat tagataatcg caatgttgat 54 0 

ttgtctcatt tcgtggttca tacgggtagt cagaaagtag gaagcaatcg ctfctgtctta 60 0 

caaaataaaa aagagtttgt gtcccttgat aatcgtcaaa cgcgtcaaaa gattttggac 660 

tacctttaca atcattttta cattgcccca aacaccttgc ttattaccaa ttcagatggt 72 0 

ggccatggct ataccccata tgtttttaaa gagattgcga aagcactcaa agtgaagcaa 78 0 

cacgaacact tttgggatag gtatcatgtc aatgaaaaga tcaaaagttt ttttaaactc 84 0 

tatccagtgg aactgatgac tggcgctttt cagagcatta aacagcatga taaagaaaag 900 
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cttagaactg ttttggatac gacagaagca ttgatactga tggaagaaga aatggaaggg 960 

tttaaccagt tcaaacgaaa gttgttaaac aattttcaat afcactaaatc agctgaattg 102 0 

agaggtttca gtcgtgcagg aataggtgtt atggagtcac aacatcgaaa aattacctat 108 0 

cggatgaaaa agcgcgggat gtattggaca attcagggag ctgagacgat gagtcaatta 114 0 

attgtcctat cgtatgaagg acagctaaga gatctcttct ttggttcttg gcgtgaagac 12 00 

tatcagaaat atcaagagct agaaaacctt agtgcgggaa aaatcaaaca tgaacagaat 126 0 

aaaatcaata agagatatga tctccagaca cttggtcggc ttaggtacgg taggcatcgfc 132 0 

aatttatag 1329 

SeqlD 57 

atggcatatt tatcaaaatt atccgatttg gatccatcgt tgatggatgc ggattctgaa 60 

caaatttata ttcctaaagt cttgtttgag cataatgact ttaaaggctt gacctacaaa 12 0 

gagattttat tatattcttt tttgttaaat cgtttaagag agccattaga ttttattcaa 18 0 

aaaggctatg atgataatga agatacctat gttcacttta aggtcgaaga tttatgtgaa 24 0 

ctactcaacc agagtaagac aaccgttatt tcctfcgaaga aaaggttagc tcaatatggfc 3 00 

ttgatagaag aagtgaaago aggtagtcac cagccgaatc gtatttattt aacagataaa 3 60 

ttagttccat atattaaggg gtaa 3 84 

SeqlD 58 

atgacagata atcgctttgc ccaattaaaa gaaaactttg aaaagggatc tcctaaaagg 60 

cgagttccaa cgtctcgccc aatcgcagct caaaaagcgc ctgagagcta taacaaaaag 120 

ggacggtatc cattttcgct ccaccaagat gtgcgttatg ataaattaga agcattagta 18 0 

gcttatcatg gagctaagtc tgcatcagat tatctggaga ggttgattgt tcaggaatgg 24 0 

gaaaagatgc agcggaagct taagaacaaa gaaaaataa 279 

SeqlD 59 

atgtttagtt ggttggaagc tctttattac actctgatac aactggcaaa agtgaaccgg 60 

ttgaatgctc ttttcttagt tagcgttgtg ggttatcttt gttaccaggg aataaaactc 120 

gtcagaaaaa ccataagaaa cttttttcag ctgatgaagg gtttcatagg tgatagagag 180 

aacatcaaga aatgcatcaa aaacaagaaa gaggcactag tccattcttg gaaacatcgt 24 0 

caagatattg attggaaatc aactggaaaa gataagagta aacagfctatg gaatcttatg 300 

aagcgacttg cgacagttgc tccttcattt ttgttcttat tattgggaaa tgttctcttt 360 

cgtctcattt atcaactgcc ttttgtaaag caagacagaa agcgatttga caaggaaatg 420 

aagcccttgc tctacttcaa gaactatcgt agttttgtgt tcatgggaat aggtttcagt 48 0 

ttcatagcgt ttattctcac aaactatttt gtgacggttt taagggctgc tattcgtttt 54 0 

ctatatttct caatcatgac gttaagagat aatagccaag tcgttagctt taacgttgat 600 

agtttgctca tccagaattt attcaatgct agggtatttg tgatagctcc cattctagca 660 

gtgccaatct ttctcattgg tttagtcgta gcttggcggt ctgcttgggt taactttgaa 72 0 

cagtaccgtg attataatca fcaatgaagaa ggggatgatc gctttgcgac tgtcaaagaa 78 0 

atccaccagc aatataagaa agttcctaat aaaacggaaa cfctatccagg tgaaggaggt 84 0 

gtgcccgttc ttcatgaaac aagaaagaat ttgacaggct taacgcttaa atctcaaatg 900 

ctttggcaaa accgtacctt tagtcgctat ttaacgaatg cggaaaggat tttagggctc 960 

ttatcgacgc cttcaggaga ttattacatc gatgatagca ccacaaactt gatcaccatg 102 0 

gggattactc ggtcaggtaa gggagaagct catattgccc ctattattga tattaatagc 1080 

cgtgcggaaa ttcaaccgtc actgattatt gcagacccta aaggggaaca ttaccagtcc 114 0 

tcttataaaa ccatgcgtcg gcgtggctat gatgttaatg tcctttcttt ccaaaacatg 120 0 

gattggtcca tgtcctataa ccctttagct cttgcgattg cagcagctaa gaagggfctac 1260 

tatgaaatga cacagacaag ggttaatgcg gttgcagaag ccatttatcg taaaacgaaa 132 0 

cctggtagcg gtaatggcaa tgcaaaatac tgggaagata cctccatttc cctctttaat 13 8 0 

gccattgcta tggccttaat ggaccgtgct aatgaaaccg tcaggaatgg tgaaaccgat 144 0 

gcttgggata ccgttacagt tcgtaacatt gccaagtttt tgactgactt gggttctgaa 1500 

gaagtctttg tcaatgattt tggagagatt gttgagaatc ctgataagaa ccaacaagtg 1560 

aagaagaaat ctaaaatcac ggtttacttt gataacttgc gtaaaatcaa tcaagaacaa 1620 

ttttccaaat ttagagatat ggctgattta aactttaggt cttctgactt cgcttcagaa 1680 

gaaactaagg gaaatgtctt ctctagcatg atgtcaggta ttaacttatt cttgcaagat 1740 

aatattgcta aactaacctc taaaaactct attgacctag aatcggttgg tttcccacgt 1800 

cgcttgtcta tcaagtttcg ttctagttcc aatgtcgcta tgcgfcaacga atacactcat 1860 

aagacggcta aggttaccat tactagtcaa gctgtttggg gtaaaaccac taaacaagtt 1920 

atccacgtag atgctgcaac agctctgatfc gatggtgaag gctatctaac ctatgtgatt 1980 

gaaccccagc ttcctgatca attcttggta acaattgact ttaatcacga aaacaatggt 2040 

ggttcagcta ttcgtcacaa aactttccaa ttctcagctg agaaagtcta taagaaacgt 2100 

ggtaacgtta ttacgttgga tgactacacg aaaaaaccag ttttggatca tatcaaagtt 2160 

actgttctca acaagcaaga tgataacctt ctccagaaag aagatattga cctgatttat 2220 

tcagataatc ctaaagtgat ttacttggta acacctccaa ataggactga atataatagt 22 80 

attgtatctc tgtttttgga tcaattgttt aatgccaatt atgagttagc tctgtcaaat 2340 
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ggtcgcaagt gtgtcaatcg aattcttcat atcctcgatg aattcacaaa cattccagct 24 0 0 

attcctcaca tggataccaa gatttccatt ggtcttggtc aaaatattct ctactatctc 246 0 

tggattcaga acttgaaaca gttagtcagt gaatatggcg agaatacagc ggaaaccatt 252 0 

cgggagaact gttctttgaa agtttatatc aaatctactg ccccagcgac caacgagtac 2580 

ttcagtaaag agttggggac tcggaccatt acacgtcgca gaaggtcaag taatattcta 264 0 

gatgaagcta atccaaatgt ttccattgaa aatcctagac aggaactctt aacaccgaca 2700 

cagctctcga aactccaaga aggggaagcg gttattttgc gtggtgttaa aggtcgagac 2760 

aatgcaggtc ggaaaatcac aacggatccg attttcttgc atgagaaaac gagccttcct 282 0 

tatcgctaca tgttcttaca agaagaattt gaccagtcga tggctttggc agatattcca 28 8 0 

gtggaaagtg ggcataggga ccttgacctg caagatatag cagtaggggc acaaagcact 294 0 

tttaataaga ttattgattg gcggatggct ctaactgacc gtatgagaac aaatgggaag 3 00 0 

atacctcaat tagcatcaag aaaacaaacg attaaagctc taagtcaatc tcaatttact 3 06 0 

tctccagcag acctaacaca agctgtgatt gcagaggtat fctgatgagga agatgatgac 312 0 

gatcttttct ttgtggatga tgtcatgtaa 3150 

SeqID 60 

atgaattcta acacaaaagg tcacggattt ttccgcaagt caaaagcata cggcttagta 60 

tgtgctattg cattagcagg tgcatttaca ttagctacta gtcaagtgtc fcgctgatcaa 120 

gttacaactc aagcaacaac tcaaacagta acgcaaaatc aagcagaaac agtaacatca 180 

actcaacttg ataaagcagt agctacagct aaaaaagcag ctgtagctgt tacaaccaca 240 

cctgcagtta atcatgcgac aactactgat gcacaagctg atttagctaa tcaaacacaa 3 00 

gctgttaaag atgttactgc aaaagcacaa gctaatacac aagctattaa agatgctact 3 60 

gctgaaaatg caaaaattga tgctgaaaac aaagcagagg cagagcgtgt tgcaaaagaa 420 

aacaaggaag gtcaagcagc cgtagatgca cgtaacaaag caggtcaagc agccgtagat 4 80 

gcacgtaata aagcgaaaca gcaagcgcaa gacgatcaaa aagcaaaaat tgatgctgaa 540 

aacaaagcag agtctcaacg tgtaagtcag ttaaatgcac aaaataaagc aaaaattgac 600 

gcagaaaata aagatgcgca agctaaagca aatgcgacta atgcacaatt acaaaaagat 660 

tatcaagcta aattagcgga aatcaaatca gttgaagctt ataatgcagg tgtacgtcaa 720 

cgtaataaag atgcacaagc taaagcagat gcgactaacg cacagttaca aaaagactat 78 0 

caagctaaat tagcacttta taatcaagct ctaaaagcta aagcagaagc agataaacag 840 

tctattaata atgttgcttt tgatatcaaa gctcaagcta aaggtgttga taacgctgaa 900 

tatggaaact caatcatgac tgcaaaaact aaacctgacg gaagtttcga gtttaaccat 96 0 

gatatgatcg atggtgtgaa gacaatcggc tatgggaaat tgacaggtaa agttaatcat 1020 

cattatgttg ctaacaagga fcggctctgtg acagcatttg ttgattctgt cactctttac 108 0 

aagtacgagt atcgtaatgt tgctcaaaat gctgctgtta accaaaatat tgtatttaga 114 0 

gttttaacaa aagatggtcg tcctattttt gaaaaagctc ataatggtaa caaaactttt 120 0 

gcagaaactt taaacaaaac tttacaactc aatcttaaat atgagcttaa accacatgct 126 0 

tccagcggta acgtcgaagt ctttaagatt catgatgact gggtacatga cacacatggg 132 0 

tctgctttag tgtcttatgt taataataat gatgctgttc ctaatgtggt catcccagaa 13 8 0 

cggccaactc caccaaagcc agtgaaagtt acacctgaag cagaaaaacc agtacctgaa 144 0 

aagccagttg agcctaaatt ggtaacgcct acattaaaaa cttatactcc agtcaaattt 150 0 

attccgcgag aatacaaacc agaaccaatc acccctgaga cgtttacccc tgagaaattt 156 0 

actccagctc aaccaaaagt gaaaccacat gtgtctattc ctgaaaagat taactactca 162 0 

gttagtgfctc atcctgtttt agttccagct gctaatcctt caaaagctgt cattgatgaa 1680 

gcaggtcaat ctgttaatgg taaaaccgta ttaccaaatg caacattaga ctatgttgct 174 0 

aaacaaaact ttagtcaata caaaggtatt aaagcttctg cagaagcgat cgcaaaaggt 180 0 

tttgcatttg tagatcaacc aaatgaagcg ttagctgaat tgactgttaa gtctatcaaa 186 0 

gcatctaatg gtgatgatgt atcaagcttg ttagaaatgc gtcatgtttt atcaaaagat 192 0 

actttagacc aaaaacttca atctcttatt aaagaggcag gaattagtcc agttggtgag 1980 

ttttacatgt ggactgcaaa agatccacaa gctttttata aagcttatgt tcaaaaagga 204 0 

ctagatatca cttataatct atcctttaaa atcaaagcta actttactaa aggtcaaatc 210 0 

aaaaatggtg ttgcacagat tgattttggg aatggatata caggtaatat tgtagtcaat 216 0 

gatgttactg ttccagaagt acataaggat atacttgata aagaggatgg taaatcaatt 222 0 

aataatagta cggttaagtt aggtgatgaa gtgacctaca aacttgaagg atgggttgta 22 8 0 

ccagcaaacc gtggttacga tctttttgaa tacaaatttg tggatcaatt acaacacaca 234 0 

catgatcttt acttacgtga taaagtggtc gctaaagttg atgtgacatt aaaagatggt 24 0 0 

acagtcatta aaaaagggac taatttagga gagtacacag aaacagttta caacaaaacg 24 6 0 

acaggtaatt atgagcttgc ctttaaaaaa gagtttttag ctaaggtttc tcgtgaatca 252 0 

gaatttggtg cagatgattt tattgtagtt aaacgtatta aggcaggtga tgtttacaat 258 0 

actgctgatt tatatgttaa tggatataaa gttaagtcag aagcagttgt gactcatact 264 0 

actgagaaat caaaaccagt tgaaccacaa aaagcaactc caaaagctcc agctaaagga 270 0 

ctgccatcaa ctggtgaagc tagtatgacg ccacttactg caattggagc aattatctta 2760 

tcagctctag gcctcgcagg ctttaaaaag cgtcaaaaat aa 28 02 



SeqID 61 
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atgaaacaaa taaaaattat cacaggactt acagtcgcaa cactatccgc agtggtaggg 60 

aatgtatacg cagaagatat cacaccgaca gcaccagtta atgagccaca agtttcaagc 120 

gaaaccgcaa aaacgcctca agttacagaa agtcaagtta atagcgcaaa agttactgcc 18 0 

gatcaagcaa taagtgatgt taataaccaa caaattgtag ttgatgaggc tcaaaacjcag 24 0 

aaagatcaat cacaacaaaa- tcttgtaaaa gccacatcaa cagtaactga agctga^aaa 3 00 

gtcgctgctg aggcaacacc tgaagttgtt aaagaagcta ttaaagctgt cactgaagca 3 60 

aaggaagctg ttacggatgc tgaagctaat gtagtcgatg cacaaaagac ggaacaaaaa 42 0 

gctaatcaag aagtacaatc gcaagctaag actgttgatg aaaatgttaa agttgttgct 48 0 

gataaagaat ccgaagtgaa acaagcagag ggtgttgtaa ctacagctca agaagctatt 54 0 

gatagcaaaa ccgctaacac taatgcatca gaagcagaaa aagcagtgac tgagaagcaa 600 

acaaagcttg aaactgctga aacaaatctt aeagaggcac aaaaacaaga tgccaaaatc 66 0 

gctgaagaaa aacgacttgc agaacaagaa gttgttaata aacaattagc tgtgacagat 720 

acacaaactc ttttgaagaa gcttgttact gagattaata acgagaaagt aagcactagt 78 0 

ttagaaaacc aagcctattt taaccaacgt gacggttcat gggcgggcta ttatggcaat 84 0 

tatacttttg ccgctactgg ctgtgtacca agtagtttag caatggtatt tacggagtta 900 

gctagacgcc aaat tact cc aactgaaatc gccaattatc tttggaacaa ttcaaatgag 96 0 

tttaataaaa acfcatggtgg aacaagtggc aaagggttag tacaagctac taaacatttc 102 0 

ggttttgtac caactcattt agcatctcaa tctgctattg tggaagcctt acaggcagga 108 0 

caccatgtcc tagcagctgt ccaacaagat aaattttctc cttggggtat caattacagc 114 0 

catgagattg tcttacgtgg ttactcaaat ggcaatactt atgtttacga tccttataat 120 0 

cgagctaaca ttgggtggta ccctgttgct aatttatgga atgaacaaag tagagacgca 1260 

attgacacat ctagtgtagg cgtaccattt ttcaaaatta caacacaaaa aatggcccaa 13 2 0 

ttagaagctc agaaagcaca agttcaatct tctctaaata cagctaaaaa tcagttagct 13 8 0 

aaaacacaag acgtattaag aacactagaa gcaacacctc ttaaaacacc agaagcacaa 144 0 

gctaagctta atcaagccaa agaagctcta gctcttgcac aagcagacta tactaaagcg 15 0 0 

caagaagctg ttaagttagc tagtcaagac ttagctgtta aagaagaaac acttaaaaat 15 6 0 

gctcaggctg atttattaac gaaacaaaca gccttgaaag atgctcaaac tgttctcgtg 162 0 

gctagtcaag ttaaattagc tgafccttaaa gcaactttag ctactgttga aaataacgtt 168 0 

aagaaagctc aagctacttt aacagatgcc aaagcaattg ttggtcaaaa acaagcaaaa 174 0 

ttacttgctt tacaaaatgc ccctaaaatt ctagcagatg ctcaagctaa acttgtaact 18 0 0 

gctaaaaatg atttagctaa taaaatggct attttagatg aagcagtcgc aaaactaaaa 18 6 0 

tccttacaag ccgttcaagc cgaagcccaa aaacaatacc atgttgtttt tgaagcctat 192 0 

aaagcagtac gagatgccaa agaacaagct aaacttgcag aaagctataa ccatattatc 198 0 

gcacgaggcg gtgaggctat cccagttgtt gacgaaacag ataaaataac tggatatgta 2 04 0 

gatggtagcc agaaagcagt tgctaatgaa gtgactcttg ccttaactag taacggtgca 210 0 

ccccttgaaa gtccagttaa taaagaaaac caaaacgtaa caaaatcatc tcaagcttta 216 0 

ccacatactg gtgaagctgg attatccatt ttatcagtat taggtgtagg tcttatctca 222 0 

actttagggc ttactagttt gaaaaaacgt cgcccacatt aa 22 62 

SeqID 62 

gtgagattaa cccacgtgag tttgtggatg ccatcaaaga gaagttggcc gctcaaattg 6 0 

cagttgttaa ggaaaggagt gattgagttg gaagatgaac tgaagacccc ttatcttgat 12 0 

caatacacag acaatttaac ggctaaagtc actaaaaaat cggatgacta ccaagtttat 18 0 

ggacgaaaca aggaggttca atccgtcatc atctctcttc tcagacgaac caaaaataac 240 

cctattttag taggagaagc aggggttggt aaatcagcca ttgtggaagg tataacgctt 300 

gctatcttac gtggccaagt tcctgaacct ttaaaaggtt taacggttcg ttccctggag 3 60 

ttgtctagtt tgatgagtga agatgacgaa ggctttattg ctaagtttaa gaagattatt 42 0 

gaagagatgg ttgctacacg tggtcataat cttctctttg tagatgaatt tcatacgatt 48 0 

attggtgcgg gtagccagaa tggtcaagcc cfctgatgcag gaaatgtgat taaacctgtc 54 0 

ttagcacgtg gtgatattca gctgattgga gcaaccacct tagatgagtt ccatgagtat 60 0 

attgaaacag atagagcctt ggaacgtcgg atgcagcctg ttatggttga agagccaacg 66 0 

atttcacaag ctattaccat tattgaacaa gctaaagtca tttatgagaa gtttcatggg 72 0 

attcaaattt cctcagatgc tgtgcatcaa gcaatccgtt tatctgttcg ctatttgaca 780 

gatcgattct tgccggataa ggcctttgat ttgatcgatg aagcggcgac gattgcttca 84 0 

gttgaaggga aaagtaaggt gacagaaaaa gatattgctc aagttttaaa agataaaacg 90 0 

ggaattccag tcactactat cctaaaggga gatcaagagc ggttagaggg tttcaaagaa 960 

aggctgatga atcgagtcaa aggtcaagaa gatgccattg aggccgttgt agatgcggta 102 0 

acgattgctc aagctggttt acaaaatgaa aaaaggccgc ttgcatcatt ccttttcctc 108 0 

ggaccaactg gcgttgggaa aacagaatta gccaaagcaa ttgcagaagc actttttgat 114 0 

gatgaagctg ccatgattcg ttttgatatg tctgagtaca aacaaaaaga agatgtgact 12 0 0 

aaactcatcg gcaatcgtgc gacaagaata aaaggacaat tgactgaagg agtaaaacag 126 0 

aagccttatt gtgtcctgtt actagatgag attgaaaaag cacacagtga ggtaatggat 132 0 

cttttcttgc aagtgctaga tgatggtcgt ttaacagata gttcgggtcg tttgattagc 1380 

tttaaaaaca ccattgtgat tatgaccacc aatattggcg ctaaaaaaat catcaataag 1440 

tgggagttga aaggaaactt taaagattta accgatcgag atcggaaaca atttgaaaag 1500 
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tcgatggaca gtgagcttca aaatgagttt cgtccagaat ttctcaatag gattgaaaat 1560 

aagctcatct tcaacctttt agaacgtgat gtgattgaaa aaattgcaga aaaaaatcta 162 0 

tctgagattg cggataggat gaagcgccaa aatttaacac tgtcatatga gcctagtctg 16 8 0 

attcagtatc tctctgatgt tggtactgat gtgaaaaatg gcgcacgtcc cttagaacga 174 0 

ctgatgaaac gaaaagfcgct ggcacccatt tctgtaaaaa gcttacagtt agataagtca 18 0 0 

aagcaaggct ataacgttca tctctgggtt gagggacggg ctccagacgg caatcatcgt 18 6 0 

caagaacaac gtcaaattca catggagata gagggagaaa gagafcaactt ttttagctga 192 0 

SeqID 63 

atggcagaag aaaatgcaca acagccatct ttacgtggaa aaagccgtcg agaacgtgtt 60 

gaatttgctc gttcccgtga tattttagat gtggcccatg aattgaatat ggaacttttt 12 0 

cgagatggca aaaattatcg ttggaaagag cacgattcta tggtaattac acctgcaact 18 0 

aatcagtggt actggttttc acaacgtcaa ggaggtgatg tgattgcgct tgtagagacg 240 

ataaaagaaa tcggcttcaa tcaagcattt gaatacctga atgaaggtac atttaaagaa 3 00 

tttactgtgg tcaatcaagt aaaagaacct ttttcttatt atttagaacc ttacgaacag 3 60 

ctttttgttg aagcaagacg ttatttgaaa gagaatcgtg gcttatcaga tgacactatt 420 

gatttttttt atgataaagg agtacttgct caagccaatg ccaaggtagg tgatatgatt 48 0 

gagcctgttt tggtgtttaa aaacttggat aagaacggcc aagtggttgg cgctgctcta 540 

caaggcttag tagctgcccc cgataaatat ttcggtcggg gttacctcaa acaaatcatg 6 00 

aaaaattccc aaccttacaa cggcatgcat gttgatattg gcacaccaaa ccgcctagtc 6 60 

tttgcggaaa gtagtattga tcttatgagt tactatgaaa tccacaaaga tagcttatca 720 

gatgttcgtt tagtttccct agaaggcttg aaaacaggaa caataggaag gcatctcatc 780 

caattaagag ctgaaatgga acgtcgtccc ctctcctcca gttggactga tgaaatactt 84 0 

gctcagggat tagatgaggc tgtaaagcaa ggctatttca aagatggtaa gaacagtcat 900 

ctccttacgc tagctgfcaga taatgatgta aaaggtaaac aattgattga agagttaaaa 9 60 

gataagagca tcccagtcat tgatgccact cctcctaaag cagaaggtca atctaaaatg 102 0 

gattggaatg cctatttgca ggaaaccaag gctacttttt ctactgaaaa atatcaagaa 1080 

aagattgatc acctaatatc agatgttatt ttaggtgatg aaacttatta tctttggcat 114 0 

gatgatgagt tggtaaattt aggagcagga gactctatta tacaggcgtt tcatcatcag 1200 

ctggaagaca gacgttatgt tattaatcag gcagaacttt acgttgaaga gtccagtaat 12 60 

gatggggcta ctggctatct atcaattgaa ggacatgtat tagataagga tggtattagt 1320 

gactatttat ctgatcaagc tttaacagat gcggaaaaag tagcattctt agaaacttta 13 8 0 

cagacagaac taccagatat ttgggacgag atagtcaatc attatgacaa ggtcfcttgaa 144 0 

gaagttgtcg ttaaatatgg actcagagaa aaacatgcgg acataattca ggaacaagag 15 00 

tfcggatttag accctttagt tgttccagaa gcaaaagaaa agtcccttga aatgaatcaa 1560 

gagactaaca ctggcggcga actatttaat cgcaattcca gttttttagg agaagattct 162 0 

ccggggacag caccgcagcc cgttgagcca actgctcaac ctgattttcc taccaatgtt 1680 

cgcttacatt ttaccactga tgatggaaat atgtcaaata aagcatttag gaaaaacatg 1740 

cgaaccttga atttatatgc taatacgatg agagattcgg cacaatggta cttatcagaa 18 00 

atagccgata ctacaatgag ttatgtttat aaaactcctc atgaagaggg agtacaggtt 18 60 

cttagtgtgc attttggtaa aaagaattgg atgcacttaa caggagtgac accagtatat 192 0 

gaaaattggg tagattcact ttctgaacaa tttattgatg atattgctaa tagtaaggga 19 80 

cattttaaaa atctaaagtt tgcactaggg actcctgata agttaaaggt tcttaactta 2 040 

cttcctgaaa ttattgaatc agataccttt gtatttaatg atttatcatc tgttcaaaag 2100 

ttaaataatc ttgatttatc acaggcgctt aatcctgaag atagcgattt acttttgcta 2160 

tttagagatg aggggcttca tcaagtgcct gcttcgttga tgaggataaa gggcgattta 2220 

gaagaaagat tatctcatat agatagtggg acagtgttag gagtttaccg agaaagaaat 22 8 0 

ggccaacttg agcaagtttc tgtcaatgag gagtatgtca aagatagtgg tcaagaaatg 234 0 

ctatctattt tacagaataa gcactatgaa gaggctcttg atagtggtca agaaatggtt 2400 

caaacggatg gtttttcagc cgaagatttc acaaaagttt tggatgctgt ctatcatgtt 246 0 

ggtgttccag atgatttagc acgtgttcca gaaggagtct tacctgtttg gcaaaaatat 2 52 0 

cttgaggtat cagaagagaa tcaatgggac ttagaacaaa tgattgatta tgcggataag 258 0 

aacagtcttt tagtaaaaga ttctgccttc tataaagagt ggaaagagga tatgatctat 264 0 

aaaaatgact atcatgttcg cttacaattt gctgagaatt gggataatgg cgttgaactg 270 0 

cctttccgta cagaacaatt gattgattac aaaacgtttg tcacaggact ttatgaagct 2 760 

aatcaagctc atcatcaacg cagacaagaa agtcagcttc cttatacaaa aacagagttt 282 0 

gacatttatg ctcctggcgg acagctcatt aaagataatg tccattatgc tattggagat 28 8 0 

gaaacaagac ctgtttcaca actcatgggg ttaggttatc gtcgcctacc tggctatcag 294 0 

gaattagctg tgatagataa cagtattctt tctcagctgg aaaataagga gctaaatcaa 3 00 0 

gaaattgctt ctgaagctaa tgaacattcc ttaaattcac aagaaatacc taaagaagat 3 060 

aactatccaa gagaagcttt cacttctcct aaacaagaca tcaaaaaggg gctagctcaa 312 0 

cgagtagaag agattgtggc agaagatgct actaagattt tagtatcttc cattcctcaa 318 0 

gttcaagaga atttatcagt cgaaggaaac ttggtcggta ctccacaagc ggataatcgg 324 0 

atgctataca ctaacctaga agattttggc caagattatc aattagagct tgcagtttat 33 0 0 

agtccaaaac gggttgactt tttagaagat gtgcaggctc cgtggacctt ggccctaatt 33 6 0 
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agaaaagaaa 

attgaagaag 

ctctataaac 

caggaaccaa 

gagatcagcg 

cttgagtaca 

atccatgaac 

ggagaagctc 

aagcggacag 

aaatcgaaaa 

aatggtcaac 

tcacttcaag 

agagatt caa 

actactctga 

gfcggataaag 

ggggttagcc 

tcagaagaac 

accattcatg 

gagttgccga 

cactttggcc 

ctaaaagcgt 

tcgaaaatgt 

gaacacggtc 



aaaagatagg 

agctagaaca 

aagctgaagt 

tacctactat 

aacacgcttt 

tggatttcat 

aatggcgtgg 

taggtattaa 

gagagacgaa 

ttacgctctt 

agctgaaaaa 

agaaagcttt 

aaactggtca 

agccaggaag 

tcaagactaa 

taatgaaaga 

agctgattct 

agttagcaca 

aaggacaaaa 

tagatacgtc 

tagaggataa 

tgaagcaagt 

taaattttcc 



ttatttagcg 

tttagctgca 

agaggctttt 

tgtagctgaa 

ccaaaagatt 

gagtaagttc 

cgcaaatgca 

accagacgat 

ggaagtcgtt 

tagaccgcta 

tgataaaggt 

agtaaaagac 

accaaggttt 

ctatcctaag 

agaggtatta 

tgatgctcat 

tattaatcct 

tgcaaccctt 

agaatttgaa 

tgaaaaggca 

acaattggca 

tgaaaaatat 

taaagcccca 
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tatggtagtg 

caaattggag 

atagctagtc 

ccgtttgact 

cgtgagtaca 

ccacaacttt 

gtcgcaactt 

gttgtccaaa 

catcaagggt 

atggttaaaa 

aatccaaagt 

ggtaaacttc 

acgacttaca 

gctatgccaa 

gaagggcttt 

gttttggata 

aataatacac 

cataatccta 

gctgagatga 

atccattata 

gactctctga 

acgaaccctc 

accaaagggt 



attgggcaaa 

acgaaaaagt 

atcagggaaa 

ataccagtgc 

cacaatcccc 

cacctcgaaa 

acgaacaatg 

caaaagcaac 

tatctgttaa 

tgatcccagt 

ataagaagct 

cagttcgtca 

aagtttttga 

accgtcattt 

gtgactatgc 

atgccaaagg 

ctggagaaaa 

agttggaaaa 

ccagttactt 

tggcaagttg 

aacgagttca 

accagctagg 

ctagtcgtta 



agaatttcag 

accagaggga 

tgaaagcctt 

ctccgcctat 

tgaggacttg 

tgtcgctctt 

gaaagccatg 

ttatgttaat 

gactggcgaa 

tcttgatgag 

atcagaagcg 

atttcaggag 

gttgtcacaa 

caactttaat 

ggaaaagatt 

agctttttat 

aattgcaact 

acagtataaa 

gttgtcaaaa 

gacagataat 

tcaaacggtt 

aagggggaaa 

a 



3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4731 



SeqID 64 

atggatgtat 

caatatgtgg 

gaaacgaaca 

gctgtccctg 

gcaactgaag 

cagaaactac 

cttctctggc 

gatgttgcga 

ccaacactta 

ctgaatacag 

aaaatattct 

atcaaccggt 

cttctcagaa 

cagaaggaca 

ttgccgctaa 

ttctttcttg 

gaatttttga 

aaaaataaaa 

gaatcaaaag 

cgtttagcca 

cagctggaaa 

catcctgaag 

aaggctggag 

ggtaaaggta 

acagaaaata 

gagttaaaag 

cgtgaattag 

gaattgcagg 

tttttagatt 

actcctaatt 

tttgaagata 

attaggcaac 

acgcttctga 

atctctattt 

tcggatgaag 

aaacgtctat 

gtttctaaac 

agaagcggaa 

caacaagcaa 

gaagaacgta 



cgtctagtcc 

actatactaa 

ggcaaatgat 

aaacgcagtt 

aacaatctaa 

gattaatcga 

aaggagttat 

ctggtcaagt 

tccagaaaga 

ataatatcca 

atcagccacg 

ttaaaagtgg 

aagaacagat 

agattacttc 

ataagaagtg 

atcgctattt 

aagagactag 

tctatgaaaa 

gatatgattt 

ataatatctt 

aaaatttaga 

caagtgtggt 

agcagccact 

taggacggcc 

ttcagctgaa 

agttagtaga 

gtacctatcg 

ttcgtcaaaa 

ttaagaaaca 

acaagctatc 

gtgttgcgct 

tfctggactga 

aagggacatc 

ttcagttgaa 

caacaataaa 

atgatacatt 

aattacaaga 

aaatcaatac 

gtaaaaaagc 

ggcaagctca 



gaatattaca 

cagagaagag 

tgaaggatta 

gaattttagg 

agaactaaca 

tctaaaaaat 

ttcgtttgat 

tgatcaaaaa 

gggcctttct 

tatccatttt 

tggacgtatg 

tgtgtatcat 

tttagctaat 

ttcagctgaa 

gcgctatggt 

agattcctat 

agattttctt 

actacgtaaa 

agaacatcat 

acgttcgttt 

gagtttttct 

aaaaagtcag 

tgaaattata 

agaatttgta 

aagtctaacc 

tgctgctaag 

ttatgcgttg 

gctactagaa 

gttaatagct 

agaagatgat 

acctgtttca 

gctaggattg 

aacgacgaaa 

atatcagatt 

ggagatgagg 

acagccatca 

acgaaaagtt 

agacttcatg 

attgatggaa 

acgttaa 



tttatgctgc 

gctgtcaaaa 

actgaagacg 

gaatacattg 

gctatcttta 

aagttggaat 

aatgcttttc 

gcgattaagg 

gattctgctt 

gggctttctg 

gagtacaaag 

ggattgctga 

ctcaaagcgg 

aaaaattttt 

tctaatgcca 

ttaaacaatg 

cagacttatg 

gttgatgggc 

ttggcacgtc 

agagaagctg 

gttttgaacc 

aaagcttggc 

aggccagtct 

tcagatactg 

ttgaaagacc 

ttaaagacta 

aaacttagca 

caggtacagc 

caggaattac 

aaagccttga 

aaagctactc 

gttcatcaca 

caagcttata 

aacaatagaa 

atagctaatg 

gatgatggtc 

atcaaaaaag 

cgacaattga 

cgtgcacgta 



aatacacaga 

ttgatgaaga 

agttgactcg 

attatatgaa 

ctcaagaagc 

cage t tat ca 

ttgeggaaca 

cagtgatgcg 

tttggtgggg 

aagttgaatc 

gtaacttctc 

aagaagaaac 

acttcataac 

tggaacaagc 

gagattttgc 

aaggtagtgc 

aaggggttta 

aaacgatcag 

gtgtaatgga 

caccccaaat 

agaagaaaat 

agaagttagg 

ataaatctta 

tttatgatat 

tctctctgtt 

atccaacaga 

tattagaatc 

cactagcgtc 

aagctatagc 

aaaatcgatt 

ctggtgccat 

ttcaagatga 

tagaagaact 

acaagcagat 

ctaaaggttt 

aaaatcagat 

ctcaattaca 

cagcttctct 

gtgatgaacg 



ggctaatcct 

attgtcctta 

tattcaggaa 

ccgctcgtat 

agattatctt 

aaatggttca 

gggattgtat 

tgatatgatg 

gaatatccat 

taaccgtgag 

tcagaaaacc 

aagatccaat 

atctatttac 

ctacaatcat 

ggttagtaag 

tgcctatcaa 

ttcagctgaa 

aacgettgea 

tttaagagag 

teaagaegtt 

tttagaacaa 

ctactttgtg 

tgataagcat 

tagtcagcta 

ttcttctaac 

gagagaaege 

tagtcagaag 

tgatcaacca 

gctacaattg 

gaagaggcag 

acaacttcct 

aaacattcta 

tcagactcat 

aagccagtta 

ctctgagcta 

tagtcaagct 

acagaegcag 

taatcgttca 

tgaggaacaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2367 



WO 2004/099242 



28/109 



PCT/EP2004/004856 



SeqID 65 i r 

atgaagaaga ataaatttct attagtcagc attgtattta tcatcatttt tgtggtacag 
ccacaaaatt ttcaatcttt aaaaaatata tttactcaaa atgatatagc tagtcaattg 
aatatatcaa gttcacctga agaaaaaaat gatggcttag gaacggccta tcaaacgcaa 
aatgaagatt tgaaatctaa aagttttgat ggtcaacatc aagtcatagt ggttaatgaa 
aaagcacaat ttacggcaga agagttgagc atgagaaatg gttcatggga aaaatacgat 3 00 
aacctagatt ttttaaatcg tgttggggtc gcagaagcta tgttaggaaa agagttaatg 3 60 
ccaaaagagg cacgtcagga catttcttca gtcaaaccaa ctggctggaa aaataaaaaa 420 
attaccttca atggaaagca agattatctc tataatcgtt ctcacttaat tggatttcag 480 
ttaagtggtg agaatgccaa tgtgaaaaat ctctttacag gaacacgtgc ccttaatgca 540 
aactttaacg atgataaatc gtccatggtg tactatgaaa atgaggtcgc aaactacatt 
aagaaaacaa atcatcatgt tcgttatcga gtgaccccac ttttcaaaaa tgtggagtta 
gtagctcgtg gcgtacgtat agaagcacaa agtattgaag atgagaccat ctcatttgat 
gtctatattt ttaatgggca accaggatat gacattgatt atttgacggg atcatctgaa 78 0 
aaaataatga ttacaaaatg a 801 



60 
120 
180 
240 



600 
660 
720 



SeqID 66 

gtgactaaag aaattaaaat tagatccatt ccagaaaaaa catgggcaca gctacatatg 

atttcggaag aatatgagta tccgtctttt aatgaattta tgcttgctca attgcaacgg 12 0 

attgtcgaaa atggtggtct tgatttatat gataataaat ttgcggaaac cttggcagtt 18 0 

attaaagaac aacaagcaca aatattggat cagctcctaa aaaacgaaat caagttactg 240 

gcttatcatg caaagcaaga tatagtggaa gagctaacaa ccgactggtt acgatttatg 3 00 

gatgatgttg atgcgctggc tgcggaaaga ggagcaggag gacgttga 348 



60 



60 



SeqID 67 

ttgttatatt attttattta cttgataaaa gtaattggaa atggccttaa attatcactg 

atttgtggtt taaattggct catcaaaata gtattcaaag gacaatttta tcttttctca 12 0 

gctgtctttt gtggcttgtt gacctactat atgccacaag atattcaatt atttacagtg 18 0 

agagttttag agctaatcat catgctaaaa gtgataatag atgttaccca tacagctctt 24 0 

tcaagagatt ttaaacggat gaaaacgcct ctttttttag gagtgatgta cgtcttcttt 3 00 

ctagctggta acagttatat taaagcacat cttttgacag aagttatggt taatcatctt 3 60 

atttcgtttt ggcttatcag cctgtttttt gccactcttg taatagttat tcagcccaga 420 

ttatttaagc attatctgtt aaaaaaagtc attgataagg aatacttggg tatccgaaaa 48 0 

ttcacagact ctcttcctcc agaaatcaat ttgtacaagg atgcggacga agaagatgcg 540 

gacaaacgga tgcgactgat caaccagaat gtcatcaaac acccctatca agaagttgtt 6 00 

gaactgagct ttttgaatag agaagtgata acagctattg gctataaagc cgttccattt 660 

gaaaaagaaa ctgaacgtac ttttatagac gatgatacca tctattatcc catttttacg 720 

gttcaccctt tcagaaattt ggaagggaaa tcagattttt atcacatact aatgaaactt 780 

aaactcagtc gaaaagcggc ctttacaaaa aatggtgagc gattattaat tagagatttt 840 

taa ~ ~ ~ 843 

SeqID 68 

atgattagaa atgaatttta taatcagtta atcaatagtg agccaatagg ttttattgat 60 

cctttcactg acttaggaga atttgattct attcagatga agttcaaaca acctgttaga 120 

aatctggtaa ataagtactc tggtaaacct tataatctta gttggcaaaa taagattgaa 180 

cagatgagag tgctatatat taaatatcag aaaagcttga agctagaaga fcgaagaacaa 240 

gaggttcata accgagttaa aaataaaaag tctaaaaaat atgttcatga aatcgttaca 3 00 

acatatttga agttaggatt caggtttaaa gaaattgaag caagggtatc cctattcaat 3 60 

actcgtcttc gtcggaactg gaaaagaagc gactatgtga caactgataa ccctgaattc 42 0 

tatttgaaga aggatctaca gaatggttat tgttctccaa attcatttct tcctcggagt 48 0 

atgaaaataa actaa 495 

SeqID 69 

atgaacgaaa tcaaatgccc tcattgtgga acagcttttg ccatcaacga gtctgaatac 6 0 

catcaattac tagaacaaat tcgtggagat gcttttgaca aagaagtaag tgaacggttg 12 0 

gaaaaagaac gtctaatatt aggggagcaa gcaaaaaatc aattacagga agttgttgta 18 0 

gaaaaagaca aggagatagc taaacttcag tacaaagtca aacaatttct tatagaaaaa 24 0 

gacaatcttc tcaaagacaa tgagtaccaa ctcgctgagc aattaaatca aaaagacatg 300 

atgcttcgcg accttgaaaa ccaaatcgat agactacgtt tagagcatga aaatagcttg 3 60 

caagaggcgc taacaaaagt cgaacgagaa agagatgcaa tacaaaatca gttgcacatt 42 0 

caagaaaaag aaaaagattt agctttagct tcagtaaaaa gtgattatga agtacaacta 48 0 

aaggcagcca atgaacaagt agaattctat aaaaacttca aagctcaaca gtctactaaa 540 

gcagtaggag aaagtttaga acattatgct gaaacagaat ttaataaagt gcgacatttg 60 0 

gcctttccta atgcttattt tgagaaggac aatacattat caagtcgtgg ctcaaagggg 660 
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gactfctatct atcgagaaaa ggatgaaaat gaccttgagt ttttaagtat catgtttgaa 720 

atgaaaaatg agtctgatga tactatcaag aagcataaaa atgaagattt tttcaaagaa 780 

ttagataaag atcgtcgtga aaaatcttgc gaatacgcag ttttagtaac tatgcttgaa 84 0 

gcagacaatg actattataa tactggaatt gttgatgtta gtcacaaata ccctaaaatg 90 0 

tacgttatac gtccacaatt ttttatccaa ttaattggta ttctaagaaa tgcagcactc 960 

aataccttaa aatataaaca agagcttgct ttgatgaaag aacaaaatat tgacatcaca 1020 

cattttgaag aagatttaga tattttcaaa aatgcatttg ctaaaaatta taattctgca 1080 

agcaaaaatt tccagaaagc aatcgatgaa atagataaat ctattaaacg tatggaagct 114 0 

gttaaggctg ctttaacaac gtctgaaaat caactacgtc ttgcaaataa taaattagac 120 0 

gatgtttctg tcaagaaatt aacaagaaaa aatccaacaa tgaaagcaaa attcgatgct 1260 

ctaaaagact aa 1272 

SeqID 70 

atgaatcatt ttgaactatt taagcttaaa aaagctggac taacaaatct taatatcaac 6 0 

aatattatca actatctcaa aaagaatagt ttaacttctc tatctgttcg caatatggcc 12 0 

gtagtatcaa aatgtaaaaa tcctactttc tttatagaaa attafcaaaca gctagacctt 18 0 

aaaaaacttc gacaagaatt caaaaaattt ccagttctat cgattttgga ttctaactafc 24 0 

cctttagagt taaaagaaat atataatcca ccagttctac ttttttatca gggtaatatt 3 00 

gaacttctat ctaaacctaa attagctgta gtgggggcaa gacaggcatc tcagataggt 36 0 

tgtcagtctg ttaaaaagat tatcaaggag actaacaatc aatttgttat cgtaagtggt 42 0 

ttagcgcgtg gcattgatac agcagcacat gttagtgctt taaaaaatgg cggcagtagt 48 0 

atagctgtta tcgggagtgg tttagatgtt tattatccaa cggagaataa gaaacttcaa 540 

gaatatatgt catataatca tctcgtatta tcagaatatt ttaccggaga acaacccttg 6 00 

aaatttcatfc tccccgaacg taaccgtatt attgcagggc tgtgtcaagg tattgtggtt 660 

gccgaagcta agatgagatc tggaagttta attacctgtg aaagagcatt agaagaggga 720 

cgagaagttt ttgccattcc cggaaatatt atcgatggca aatcagatgg atgccaccat 780 

cttatccaag aaggagctaa atgcattatt tcaggaaaag atatcctttc tgaatatcag 840 

tag ~ 843 

SeqID 71 

atgactgaac gaacattcga agatattgaa cttgacttaa agttattcca aataaagctt 60 

gataatgctg agaatagtaa aaggctctta caaaaattga aaaacgacgt catggagtta 12 0 

caaatagagt tactagaatc attaaaactc ggtgatgcct atttaacaga atcagaagaa 180 

ttagaagaga ataatgactt tattctaacg gtaaatagtg aaacactaag tttatcctat 240 

gacaatagga taaacttagt ttctaaagag attatggatt atgaaaatgc attagataag 3 00 

ttgtattatg aaaaacagag tttgatgcaa aaaagtaatg aaagaaaagg aggttaa 357 

SeqID 72 

ttgttcaata aaataggttt tagaacttgg aaatcaggaa agctttggct ttatatggga 6 0 

gtgctaggat caactattat tttaggatca agtcctgtat ctgctatgga tagtgttgga 12 0 

aatcaaagtc agggcaatgt tttagagcgt cgtcaacgtg atgcagaaaa cagaagccaa 18 0 

ggcaatgttc tagagcgtcg tcaacgcgat gcagaaaaca gaagccaagg taatgttcta 24 0 

gagcgtcgtc aacgtgatgc agaaaacaga agccaaggta atgttctaga gcgtcgtcaa 3 00 

cgtgatgcag aaaacagaag ccaaggtaat gttctagagc gtcgtcaacg cgatgttgag 3 60 

aataagagcc aaggcaatgt tttagagcgt cgtcaacgtg atgcggaaaa caagagccaa 42 0 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaagg caatgtttta 48 0 

gagcgtcgtc aacgcgatgt tgagaataag agccaaggca atgttttaga gcgtcgtcaa 54 0 

cgtgatgcag aaaacagaag ccaaggtaat gttctagagc gtcgtcaacg cgatgttgag 60 0 

aataagagcc aaggtaatgt tctagagcgt cgtcaacgcg atgttgagaa taagagccaa 66 0 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaagg taatgttcta 72 0 

gagcgtcgtc aacgcgatgt tgagaataag agccaaggta atgttctaga gcgtcgtcaa 78 0 

cgcgatgttg agaataagag ccaaggcaat gttttagagc gtcgtcaacg tgatgcagaa 84 0 

aacagaagcc aaggcaatgt tctagagcgt cgtcaacgcg atgcagaaaa cagaagccaa 90 0 

ggtaatgttc tagagcgtcg tcaacgtgat gcggaaaaca agagccaagt aggtcaactt 96 0 

atagggaaaa atccacttct ttcaaagtca attatatcta gagaaaataa tcactctagt 102 0 

caaggtgact ctaacaaaca gtcattctct aaaaaagtat ctcaggttac taatgtagct 108 0 

aatagaccga tgttaactaa taattctaga acaatttcag tgataaataa attacctaaa 114 0 

acaggtgatg atcaaaatgt catttttaaa cttgtaggtt ttggtttaat tttgttaaca 12 0 0 

agtcgctgcg gtttgagacg caatgaaaat taa 123 3 

SeqID 73 

atgaaaagac taacttatta ttttaagggg tatatcaaag aaactatctt tggacccctt 60 

ttcaaattat tagaagcctc ttttgaactt ttggtaccaa tcgttattgc aaaaatgatt 120 

gacgagacca ttccacgagg ggatagaagt agtttactgt tgcaaattgg attgattttc 18 0 

tttttggctg cggtgggtgt tgtagtagcg ataactgctc aatattattc ttcaaaagcc 240 
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gcagttggtt 

aaaaaagaca 

tttcaaatcc 

gtttttggtg 

gtaatggttg 

taccttaaaa 

ggtgtacgag 

atcaattatc 

cctctaacat 

aatatagcta 

caaattcttg 

atcagtgcta 

cttgagccaa 

aattcttctg 

gggataattg 

tataaagtgc 

tcaaattggc 

attcgttcta 

ttagaaatag 

gtagagagtt 

gctttagttc 

tatttaacag 

attattgtat 

aaggggaaac 

aagtccattt 



atacgagaca 

gagatgaatt 

aaactggttt 

ccataattat 

tcacgttatt 

tcagaacttc 

ttatccgtgc 

attatacgaa 

ttctagttgt 

atcatctttt 

ttgaattgtt 

agcgaattat 

aatattcgaa 

aaaaagcttt 

gtggaactgg 

aagaagggga 

gtaccctggt 

acctttcttt 

cacaagctag 

ttggtagaaa 

aagataagat 

aagctcgttt 

cacagagaat 

aagttggttt 

accattcaca 



attgacagag 

gggaactgct 

aaatcaattt 

ggctttttcc 

tatcattgtt 

aaccgattat 

ctttaatcaa 

tttacaatta 

taatatcact 

atcacaagga 

aaaaatgaca 

agccgttttt 

taaggcttta 

atctgatatt 

ctctgggaaa 

tattgatata 

aagagttgtt 

gggacttggt 

tgattttgta 

tttctctggt 

accatttttg 

atttaaagcc 

taatagtata 

tgataatcat 

aaattttaag 
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gatctctacc 

agtttgatta 

ttacgtttat 

attagcccct 

tttgtaatgt 

ttggtaaagt 

gtagatagag 

aaagcaggta 

ttggttgtga 

atgttggttg 

atgttggtga 

gaaagaccgt 

gaagtacaag 

actttttcta 

tcaaccttgg 

tatcatcagg 

cctcaaaatg 

aaagttagtg 

aaagaaaaag 

ggtcaaaggc 

attttagacg 

ataactaaac 

caaaatgcag 

caatctttat 

gaggaggagt 



aaaaagtcat 

ctcgtttaac 

ttttgagggc 

cattgactat 

ctcgactatt 

tgactaggca 

agagtgaagc 

ggctttctag 

ttatttggcg 

ctttaattaa 

catcactgaa 

ctgaaataat 

aaatggcatt 

tgaatgtagg 

ttaatttact 

gaaaaagtcc 

ctcagttgtt 

aggaaaaa c t 

atggtcaact 

aaaggttgac 

atgcaacatc 

atttcaacca 

atagaatctt 

tagctcataa 

aa 



gtcactgggg 

tgctgataca 

tccgattatt 

ttggttcttg 

aaatcctatt 

acaacttcaa 

atttaatgat 

tttagtaacg 

tggtaattta 

ttacttattg 

ccaaagttat 

tgatgacaaa 

ttcttatcca 

agaaacttta 

gcttcatatt 

agatacaatt 

taaaggaact 

ttggactgct 

tgatgcccct 

gattgcaaga 

tgcattggat 

aactaatctt 

actccttgat 

caaagtttat 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1722 



SeqID 74 

atgaaattta 

gctatgtatg 

gctgagcaac 

ttacaattac 

tttcagtctg 

ggagctttaa 

gcaaatttct 

ataccagttg 

actttatcac 

ttgaagtcaa 

ggcttagttg 

tattttggag 

gtttctatgg 

ttagacagta 

ggcttattta 

tctgggatac 

ttgttaccta 

atgttagttt 

tttgagcata 

tttaatggga 

ggtattgtta 

aaaaaattaa 

SeqID 75 

atggctaata 

gctatagata 

ggacatctct 

cgttggtggt 

gagctaacag 

tttgaatcgg 

tatgaacatc 

gatgagacca 

ggtgattatt 

ggcggtagcc 

aaacaaatca 

aaacgtctcg 

gagagtttac 

cgctatacgg 



atgagcaaag 
caggatccat 
taacttatct 
aattaagtaa 
tagctccttt 
ttgcttcagg 
ttcctccaat 
cgatgggaaa 
tggtaacgat 
tttcaatcct 
atgcttctgt 
ctcctagatt 
tagaatcaac 
agagacttcg 
atacctttcc 
gtacgcgtaa 
aatttggcgc 
tatttggtat 
atgagcataa 
caaatctctt 
tctcaaccct 
tatga 



taactcgcaa 
tcttgtacct 
tattgcgaca 
gcactttgga 
atcaattatt 
aatatatgtc 
tgtaacagga 
tatgggtgat 
tggtgttgtt 
tattggactc 
ggtagcagaa 
tgaatttact 
aggtgtttac 
taatggttac 
atataccggt 
gccgatatat 
aatggctcaa 
ggtagcactt 
ctttatcatt 
cattagtctt 
gacagctgtt 



gcagcacttc 
atcatgattg 
gatattttta 
gtgggtcttc 
ggtgcacaac 
gttttagttg 
tcagttatta 
aatgcaaaag 
ttattaatta 
atatcaggta 
gcaccacttg 
tctattttaa 
cttgcgcttt 
cgttcagaag 
ttttctcaaa 
tttacagctc 
atgattccaa 
caagggatga 
gcagccgtat 
cctaatactt 
gttttaaata 



taggcttgca 
ctagtgctct 
tgtgtgggat 
cagttgtatt 
aaggttccgg 
ctggcatttt 
ctacaattgg 
aaccgagctt 
atattttcgc 
caattcttgc 
tacatattcc 
tgatgtgtat 
cagatattac 
gattggcagt 
atgtgggact 
tcttcttagt 
gtccagttct 
aaatgcttaa 
caattgcagc 
tacaaatgtt 
tcattttgaa 



acacctgtta 
tggttataat 
tgccacctta 
agggtgtgcc 
ctatatgttt 
ttctaaagtt 
tttgacatta 
acaatcttta 
aaaaggcttt 
agcatttatg 
gaagccattt 
tattgcaaca 
aaacgataaa 
attacttggc 
ggtacagata 
tatacttggc 
tggtggtgct 
tcaggttgat 
aggagtaggt 
tttgacaaat 
tggattgcct 



catacgattt 
ttgtggccgt 
attgggatga 
caagaagcat 
gaaagaaacc 
ttacggtaac 
acaattttga 
ttgattttgc 
ttgaacctgt 
ttcaggggat 
tgaagcattc 
tgttttcgga 
agaatgtccg 
ctgatttatt 



aatttcacag 
tgcgagctca 
acatgatagt 
ggggaccaat 
tagctttcga 
accagtcgtt 
cttagggcga 
tttggcatca 
tattatcttt 
cgttgaaaat 
agacggtcta 
agctcctatt 
cttagtcgct 
gacagatggc 



cgtattgaag 
ctaggactta 
tttcatattt 
accattgatt 
gaaactgtca 
agagaaccct 
cagtatctta 
ggtagtatga 
aaaagttttg 
aaagtccagc 
gcaggttttc 
gaccttctct 
atggatgggg 
cagtattctc 



cacaacgaca 
acttgaaaca 
acccccaaac 
tagtccaagt 
actttttaga 
ttaagcacta 
aagaagaacg 
gctctgcgac 
cggaagacgg 
atcctgaacg 
atttggacgt 
cttattatga 
tcaaaaaagg 
aaaccatgcc 



gaaactcatc 
aggttcaggt 
taacaccttt 
catccaagaa 
aacagggcaa 
cctcgctcct 
agggctatct 
attgaaaaag 
ccgaatgatc 
tggccgtctc 
tggaacacct 
attacataag 
ggfcgattagt 
cagagagtcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1275 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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attagaggag cgatagatgc cattaaccaa acgactagaa tccttaaaaa caatcccaat 9 00 

atgattacca ttgccgttga taatgacgag gcaggccgaa actttattaa ggagttacag 960 

gaagatggca ttccaattaa cgtagacctc ccaccacgga aagaacacca gagtaaaatg 102 0 

gattggaata attatctgaa acaaaagaaa ggattactaa aaatgccaca aacagaaggc 108 0 

acacaaaaag ccccagaaca agttctggag catgaaaaaa tggataggag ccaaattagt 1140 

tcgggctctt tagaggacga ccctcaaggc agtgccaaac ctgtatccaa aagggatacc 12 0 0 

tttgagcaag ctgttaccag ccacccgact ttctcctatc ctttactaca gtttagcaca 12 60 

gaagaagcct ttgtgtcaaa cgttagagat ggctaccaca tcgcaagtga ggaggatatt 13 2 0 

cggaatctca attactatgc ccctagtctc caacaaacgg ctaactggta tcgggataat 13 8 0 

ctggcagacc gtcaagtgac ttatatgtta aaaggggata aggagataaa ggcgcttcag 144 0 

gtcagttttg ccaaggataa gtttgcccac ttaaccggta ttcgtccgat tggaaaaggg 15 00 

ttgtcagctg aaaaactgtt agatgatttt gcgaaaggtc gaggttccta tcctaatctg 1560 

accttatcaa atggttttaa tgacaagata caagttttgc caatgattca agaactatct 1620 

caatccaagt cttttgtctt tacagattta gaggaagttc aaaaaatgag aaacttaaag 168 0 

gccagtcatg cgattcaatc caataatcgg agtttagtcg tcgctttaaa aactattgat 1740 

gatgtcacct ttccttcgtc cctcctaaga ggaaaaaaga atctaaatga tgatctgatt 18 00 

caaaaggcga aagaaaatga agttctaggt gtactcagtg agaaagatgg gaacatcacg 18 60 

gtgctgtccg ttaacgataa gtacatccaa gatggtgggc aagccttaaa agatatgatt 1920 

aaaaatggcg aacttgaacc actccaaatg gagacgatac agcgacatgt tcctcatgag 19 80 

aatgcctatc ctaaggattc tgacggtgat ggcttaaccg atgatgaaga aatcgctcta 2040 

ggcaccaatc cctttagttc tgatagtgat ggtgatggga ctccagataa tgtcgaaaag 2100 

gcaaatggaa cagaccccac caatgcttct gataatgagg tgacaaggca acaagaagcc 2160 

aataaacgtg acttcacttt gtcagaaatg attaaagcca aaaataccgc agcgctgaat 2220 

caacacttac aagacggcat caaacagtat tttgatagtg atacctataa gcaatacctg 22 80 

gaagggatgg cccacttcaa taactactcg ccacgaaata ttcaattgat tatgtcacaa 23 40 

tttccagaag catcgatggt tgcgtctttc caagagfcgga ggaagcgaaa tggttcggtt 2400 

aaaaaaggtg agaaggccat ctatatccaa gcccctgttt ctgttatgaa aaaagatgag 2460 

aatgggaaac ctatccttaa tcctgaaacg ggagaaaagg agaccatcac ttactttaaa 2520 

cctgtccctg tctttgacat taaacaggtc tctccccaag aaggaaaaga actcaatctc 2580 

cctaaagcca tgggaactat tccagaacaa ttggataaag aatactacca aaatgtctat 2 640 

cgtagcttaa gagatatttc tcagaataac aataaggttc ccattcgatt tagagaactt 2700 

ggacaagaag atgggttcta tagtccacag acaaatgaaa ttgttattaa gaaagggatg 2760 

tcttatgaac gaaccttgtc aacgctgatt catgaaatgg ctcattcgga attacacaac 282 0 

aaacaaagtt tgacggaacg ctttgatggt aaactaacaa gaagtaccaa agaacttcaa 2 880 

gcggaatcca ttgcttatgt tgtatccagt catctaggtt ttgataccag tcaagagtca 2 940 

tttccttatt tagcgtcttg gtcaaaagaa aaagacgggc ttgccaactt aacagcacaa 3 000 

ttggaaatcg tgcaagaaga agccaaaaac tfcaatggaac ggattgatca acagttaagt 3 06 0 

caatatcaaa cagtaacact gaataaagaa acccaacaat tgactaaaca agaaatgaaa 312 0 

aagcagactc atccctttta tcaaagttta gcagccgcta aaacatcaag agcacaagtc 318 0 

accactcagg agaaggaagc aagcgtaaaa aaagacaatc gacctacgat gccgtag 3 237 

SeqID 76 

atgaattcac aagaaacaaa aggtcacgga ttttttagaa aatcaaaagc atatggttta 60 

gtatgtggga ttgcactagc aggtgcatfct acattagcta ctagtcaagt gtctgctgat 12 0 

caagttacaa ctcaagcaac aactcaaaca gtaacgcaaa atcaagcaga aacagtaaca 18 0 

tcaactcaac ttgataaagc agtagataca gctaaaaaag cagctgtagc tgttacaacc 24 0 

acaacagcag ttaatcatgc gacaactact gatgcacaag ctgatttagc taatcaaaca 3 00 

caagctgtta aagatgttac tgcaaaagca caagctaata cacaagctat taaagatgct 360 

actgctgaaa atgcaaaaat tgatgctgaa aacaaagcag aggcagagcg tgttgcaaaa 42 0 

gctaataaag caggtcaagc agaagtagat gctcgtaata aagcaggtca agcagccgtt 48 0 

gatgcacgta ataaagcaaa acagcaagcg caagacgatc aaaaagcaaa aattgatgct 54 0 

gaaaacaaag cagagtctca acgtgtaagt cagttaaatg cacaaaataa agcaaaaatt 60 0 

gacgcagaaa ataaagatgc gcaagctaaa gcagatgcga ctaatgcaca attacaaaaa 66 0 

gactatcaaa caaagttagc aaacattaaa tctgttgaag cttataatgc aggtgtacgt 72 0 

caacgtaata aagatgcaca agctaaagca gatgcgacta acgcacagtt acaaaaagac 78 0 

tatcaagcta aattagcact ttataatcaa gctctaaaag ctaaagcaga agcagataaa 84 0 

cagtctatta ataatgttgc ttttgacatt aaagcccaag ctaaaggtgt tgataacgct 900 

gaatatggaa actcaatcat gactgcaaaa actaaacctg acggaagttt cgagtttaac 96 0 

cacgatatga tcgatggtgt gaagacaatc ggctatggta agcttacagg taaagttaat 102 0 

catcattatg ttgctaacaa ggatggctct gtgacagcat ttgttgattc tgtcactctt 108 0 

tacaagtacg agtatcgtaa tgttgctcaa aatgctgctg ttaaccaaaa tattgtattt 114 0 

agagttttaa caaaagatgg tcgtcctatt tttgaaaaag ctcataatgg taacaaaact 120 0 

tttgcagaaa ctttaaacaa aactttacaa ctcaatctta aatatgagct taaaccacat 126 0 

gcttccagcg gtaacgtcga agtctttaag attcatgatg actgggtaca tgacacacat 1320 

gggtctgctt tagtgtctta tgttaataat aatgatgctg ttcctaatgt ggtcatccca 138 0 
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gaacagccaa 

gaaaagccag 

tttattccgc 

tttactccag 

aaagtagcgg 

gaaaacgggc 

gctaaacaaa 

ggttttgcat 

aaagcatcta 

gatactttag 

gagttttaca 

ggtctagata 

atcaaaaatg 

aatgatttga 

attaacaatg 

gtaccagcga 

acacacgatc 

ggcactgtaa 

gaaacaggtc 

agtgcctttg 

aataccgcag 

actcctgaga 

ccatccacag 

ctcttaggcc 



ctccaccaaa 

ttgagcctaa 

gagaatacaa 

ctcaaccaaa 

tgcatccagt 

aatctattaa 

actttagtca 

ttgfcagatca 

atggtgatga 

accaaaaact 

tgtggactgc 

ttacttataa 

gtgttgcaca 

caactccaga 

gtactgtcaa 

accgtggtta 

tttacctaaa 

ttccaaaagg 

gctatgagtt 

gggcagatga 

acttctttgt 

aaccaaaacc 

gagagcaagg 

ttgtagggtt 



accagaaaaa 

attggtaacg 

accagtccca 

agtgaaacct 

acagatacca 

cggtaaatct 

atacaaaggt 

accaaatgaa 

tgtatcaagc 

tcaatctctt 

aaaagatcca 

cttatcattt 

gattgatttt 

agtccataaa 

actcggtgac 

cgatcttttt 

agataaagta 

gacaaatcta 

agcctttaag 

ctttattgta 

caatggcaat 

agttatgccg 

ggtatctgtc 

taaaaagcgt 



32/109 

gttacacctg 

cctgttttaa 

agtacccctg 

catgtgtctg 

aaggccacac 

gtattaccaa 

attaaagctt 

gcgttagctg 

ttgttagaaa 

attaaagagg 

caagcttttt 

aaagttaaaa 

gggaatggat 

gacgtgttag 

gaagtcactt 

gaatacaaat 

gtcgctaaag 

gttcaatata 

gcagacttcc 

gttaaacgta 

aaggtaaaaa 

caaaaagtaa 

ctaacagtac 

caacagtaa 



aagcagaaaa 

aaacttatac 

agacgtttac 

tacctgaaaa 

caactaagaa 

atgcaacatt 

ctgcagaagc 

aattgactgt 

tgcgtcatgt 

caggaattag 

ataaagctta 

aagagtttac 

atacaggtaa 

acaaagaaga 

acaagcttga 

ttgtagatca 

tagcaattac 

ctgagactgt 

tcgcacaagt 

tcaaagcggg 

ctgaaactgt 

ctcctaaagc 

ttggtgccgc 



accagtacct 

tccagtcaaa 

ccctgagaaa 

aattaactat 

agttctagat 

agactatgtt 

tatcgcaaaa 

taagtcfcatc 

tttatcaaaa 

tecagttggt 

tgttcaaaaa 

taaaggtcaa 

tattgtagtc 

cggcaagtct 

aggatgggtt 

cttacaacac 

acttaaagat 

ctataataag 

ttcacgttct 

tgatgtttac 

ggtaacacat 

accagcttta 

cttactctca 



SeqID 77 

atgaatcaga 

aatgtatacg 

gaaaccgcaa 

gatcaagcaa 

aaaaatcaag 

ttagtgcaag 

aggttgtcgc 

aatcaagtta 

acagctgatg 

actatagtta 

gttaaaactg 

caaaaagcaa 

acacagcttt 

caaagtcatt 

tttgcttcaa 

cggacaataa 

aaacgtttcg 

gttgtgactc 

gttgttgctg 

attgtattga 

aacaatggtt 

actgctggtg 

actaagcagt 

caagctaaag 

ttgattaaag 

tcagtcaaat 

tctaaattaa 

aaaacaaaac 

gcaaatcaag 

aatctacgca 

gataacttag 

aaaactaaag 

gttttatcag 

ggtcacgaag 

aaacacaaaa 

tcagttgcta 

cttctgggta 

gttattaatt 



taaaaattat 

cagaagatat 

aaacgcctca 

caagtgatgt 

cacaacaaaa 

aatctaccaa 

agacagaagc 

atgcacaaca 

ttaaacaagc 

atagtgattt 

cgacagatgc 

aagcacaaca 

cacaagctaa 

attacaatca 

ctggatgtgt 

cgccgagaga 

gtggcacaag 

atttggctag 

ctgtacaaaa 

gaggtagttc 

tttacagtgt 

taggtgtacc 

cacaagcttt 

caacaggtct 

cacagtcaaa 

tagctaatca 

aagatgctca 

ttaatcaatt 

attacaagca 

atgcacaagc 

caaataagat 

cagtagatgc 

ctaaagcaag 

cagtagcagt 

tagaaatgaa 

aagcaagcgt 

tgttattaac 

aa 



cacaggactt 

cacaccgaca 

agttacagaa 

taatgcacaa 

acttgttaac 

tcaaaatcaa 

taatcaaaaa 

aacagttgtt 

tcaagcatca 

aaataaagca 

acttacaaaa 

aatagttgat 

tgctgaggtc 

acgtgataat 

cccatcagca 

gatagctgat 

tgggaaaggt 

caaaaatgct 

taataaattc 

taatggcaat 

tgaccgcatt 

gtttttcgca 

agcaagccaa 

taatgcagta 

tttaaaagat 

agataatgtt 

aaagcaatta 

aaaacaagtg 

agctaaagat 

taatttgatt 

tgctaagtta 

gcaatcacag 

tttaaaatta 

agttgacgaa 

atctcttgtt 

gttgccaagt 

gttctctggt 



acagtcgcaa 

gcaccagtta 

agtcaagtta 

aaaaatgtag 

gcaactacaa 

gtacaacaaa 

attactcaaa 

gttaacaatg 

gtcgatacag 

cagtcaaacg 

gcacaagcga 

tcagcaaaac 

aatcatcaca 

gcttgggctg 

ttagcaatgg 

tacttataca 

attatttctg 

ataactgaag 

agtccctggg 

acgtatgttt 

tggaatgaac 

attatgacta 

caagttgctc 

actatgcaga 

gctcaaaagc 

aaaaaacaag 

gcagcagcac 

ttagcagaag 

aatctaacac 

aaagcacagt 

caaagagaag 

tatcaaaaag 

gcggaagaga 

acaggtaaaa 

gcaactaaga 

actggtgatg 

tttttaggta 



cactatccgc 

atgaaccaca 

acagcgcaaa 

ttaataatgc 

cgttaaatga 

cagttgattc 

ctgaacaagt 

agcatgatgt 

ctaaagatgc 

tcacaactaa 

ctgataaaac 

aaaacttatc 

agtttaaaac 

gggtatatgg 

tttactctga 

acaacacaga 

ctactaaagc 

cattaaaagc 

ggcctcaata 

atgatccgta 

agtcacgaga 

aaaatatggc 

aaaaacaatt 

caccgattgc 

gattagcaga 

ctgacttaac 

aagctaaatt 

caagtcaaca 

aaaaaactgc 

ctgatgtagc 

tagcttattt 

ttttatcagc 

aagctcgact 

ttactagcta 

caactgatgt 

ttaaacaagt 

tacgtaaaca 



agtggtaggg 

agtatcaagc 

agttactgcc 

tcaaaatcaa 

aacacaaaaa 

cgcaaagcaa 

caaagctcaa 

tgcaactaag 

tttgactaat 

gacagctgat 

acttactaat 

tgctaaagat 

ggctctaggg 

agggcataca 

tttatcaaat 

tgaattcaat 

gtttggttat 

aggtcatcat 

cagtcacgaa 

taaccgtgat 

tagtattgat 

taatgcttta 

aaatgatgca 

acaagctaat 

agcacaagca 

aaaagcagag 

gacaacaagc 

agtagcccaa 

ttatctaaca 

acaagctaaa 

acaagagtta 

ttataagtca 

tgataaaaag 

tattacttct 

taaacaagta 

atcagttgct 

aagtaaaaaa 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2799 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2232 
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SeqID 78 

atgatatcac 

gttactaaat 

ttaaaatgct 

acaagtagag 

gatttgttgg 

tttgaagatt 

tgcattttac 

gcaggaacaa 

caattttctc 

gaacccggtt 

agagttgaaa 

gtgtcatggg 

gtcagtcaaa 

SeqID 7 9 

atgatttatt 

accaatgtca 

gcaaatcaac 

cgtgaaatta 

gctcttgcaa 

gtcctacata 

ccaaaccatg 

atggtatctc 

ggtcagcttc 

aaaatggaac 

tttcacggtc 

cttcatggtg 

gctggaatgt 

attagtcagc 

aacggtcaag 

ttgttattga 

gcaggaacag 

ctaaatgaat 

ttagctaaaa 



gtaaagttgc 
tagtttcaga 
ttggagaaaa 
aatcggttga 
ttaataatgc 
ggatgactat 
cgaaaatggt 
ttccttatcc 
tcaacttgcg 
tatgtgaagg 
aactttatga 
ttgctagcca 
cttatggacc 



gcttgtgacg 
tggttatagc 
gttttcagag 
caaagcttta 
aggtttagca 
gattaatact 
agaagttaat 
aggagctaat 
tgccgattta 
gacagaattt 
aggtgcgcat 
accagaacac 
tcaacctgtt 



33/109 

ggggcatctg 

gttattggtt 
ggttactttt 
gaaagtctgc 
cttggattgg 
aacgttgtag 
agagggttaa 
gtctatggcg 
gefcggaacta 
tcaactgtac 
gcaattcaag 
attaatatta 
tatcgtgatt 



ctggttttgg 
gtgctcgtcg 
atccacttca 
ctaaaaattt 
ataaatcata 
gattaattta 
ttatcaattt 
catcaaaagc 
aaatcagagt 
gtttcaaagg 
cagaggatat 
atcgcataga 
aa 



tgcagctatt 
aatggataaa 
aatggatatt 
gcaatcaatt 
tgaagctgat 
cttgacacga 
gggatcaact 
ttttgtaaaa 
tactaaccta 
agatcataaa 
tgctaatacc 
aataatgcca 



tagacaafcgc 
tgacaagtaa 
fctttacgtga 
ttttcacttc 
atcagctaaa 
ctatgactta 
gacaaattac 
tcafcgtttgc 
tcaggaacca 
ttgatcctca 
caaaaggtgt 
gagaccaaga 
ctcaagctct 
tgagaacgac 
actgcttacc 
cacagttaga 
tcgaacctag 
caatacgtat 
ccttaagaaa 



tgctactacc 
ctatggtaat 
atgtcgacaa 

tgggggaact 

aggtaaacat 
cttatcagag 
tgcaaaagac 
taataatgaa 
ccaagctgtt 
ttctttagga 
tgggatactt 
ggaaaaaagg 
tactgatgct 
ctttttagat 
tcatgtactt 
tttagctgga 
tcatgtctta 
tagtttttca 
aattatagga 



gctctaaccc 
ccatctagta 
attattgctg 
gagagcaaca 
attattacct 
cgatttggtt 
gttcaagaag 
accggggact 
tttcacgttg 
attgactttt 
tactgtgcfcc 
cgtgcttcaa 
acgactaaca 
gctatttcag 
aatataggtt 
ttcgcagttt 
acaagcttgt 
gaactaaata 
gattaa 



catctgttat 
tacatacctt 
aatatctaaa 
atacagctat 
ctgaaattga 
ttgatattac 
ctttacgaga 
ttttaccaat 
atgccgttca 
tagctgcttc 
cccatcactt 
ctgaaaatat 
cccttaaaaa 
accttgactt 
ttcctagaca 
caacaggttc 
atggagccaa 
cccaagaaga 



tgagaaaatg 
tgggcgtcaa 
tgttaafctca 
caaagggtat 
acatcattca 
ttacttaaaa 
tgatactatt 
tcaagagatt 
agtctttagc 
tgcccataaa 
tgatagtcta 
aattggtatt 
ttggactcac 
ctatcttaat 
gaataatggc 
tgcatgtact 
ctcaccacgt 
aattcttgaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
762 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1116 



SeqID 8 0 

atgtctagga 

atgagtcttt 

gtagaaaacc 

agcattttac 

gaaattctag 

gttcaataca 

ggcaagacag 

aacaaatttt 

catgaagttc 

tcttctggaa 

ggttatcgtg 

tttttaaact 

gtacgacaaa 

atttactacc 

tctgacatca 

gaatcaacca 

acattaactg 

aatcttatcc 

gctgaacaag 

gtaatttctt 

cttgaaaatc 

ccacaagcag 

aactatagct 



aaacatttaa 
attacactga 
caaatcctca 
cgcatfcacat 
attatattgc 
gcaacatcga 
aagaaatagc 
ggattgacgt 
gatgggggac 
aagttttaga 
ttggagatta 
ctcgtttaga 
accttcaaat 
aaatcgggaa 
ttacagaacg 
tttcgattaa 
gttatcaatt 
caactacaaa 
aatatactgt 
atgatgcaag 
agcctaacat 
cttgttatcc 
atttagaaaa 



acatatacta 
aaaagcccac 
cattattgta 
cgaagtcaat 
tcgaaattta 
gtttaaggaa 
aattgataac 
tccagtaact 
tccagtcgct 
tgaatacaca 
tatcacagac 
taaagaaggt 
tgataaattg 
tatccgtcca 
ctactatgtt 
aatggttgat 
ggcaactgtc 
atcaggagaa 
ttacctttca 
acctgttgat 
ctatactgaa 
aaacaaacaa 
atag 



tcgattggag 
gctattgctg 
aatgttacag 
gtaaagatgg 
aactctagtg 
agtgcttatc 
gacggtgtta 
tgtactgtta 
atatcacacc 
aatctacaca 
tatgcacttt 
tacaaacttc 
attttcaatt 
ctattaagtc 
tctaaaaatg 
gccaaaactg 
tcccatgtct 
agatatttta 
gaaacacctt 
tgggattatc 
gaagattcaa 
tttgcttgcg 



tttgcacgct 
gtcctagtga 
gtactgatca 
gacaaacttt 
ttggaggaga 
tgaagcgtca 
ctgfcacctaa 
ctcctatcgt 
gtatttactt 
ctgctgattc 
ctaagtctgc 
aacatcgtat 
atgactttaa 
gctcatcagc 
ctaaaagttt 
aacaaccgct 
ataacagact 
ttcaaaatat 
attctaaaga 
actcaggcgc 
ctgaattttt 
aaaatactga 



cgtactatcg 
ccgccaatac 
aaacggaaat 
aagtaaagaa 
aagtaaaaac 
attagatgat 
agacggtcca 
aacagaaaca 
tgttgaagaa 
ggaacttaac 
ttacgaagct 
tagcacgaat 
tgaggaaaat 
tgaagtagaa 
agcacgtaca 
atttaaccac 
ctttgaagaa 
gaaaaaaaca 
gaacgctccc 
ttcaggatca 
gggfcaataaa 
ctctaaatac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1344 



SeqID 81 

atgaatcctt taataattgg aatgaatgat aaacaagcag aagcggtaca aacgacagac 

ggaccgcttt tgattatggc aggagctggc tctggaaaaa cacgtgttct gactcatcgt 

attgcttatt taatagatga aaaatatgtt aatccttgga atattttagc gattactttt 



60 
120 
180 
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actaataaag cagcgcgtga aatgcgtgaa cgtgctatcg ccttgaatcc agctacccaa 240 

gatactttga ttgcaacttt tcatagtatg tgtgttcgta ttttacgtcg tgaagctgat 3 00 

tatattggat ataatcgtaa ctttacaatc gttgatccag gtgagcagcg tactttaatg 360 

aagcgaatca ttaagcaatt aaatttggac acaaaaaagt ggaatgaacg ttcaatttta 420 

ggtacaatct ctaatgctaa aaatgacctt cttgatgaaa ttgcatatga gaagcaagcc 48 0 

ggagatatgt atacacaggt aattgctaaa tgctataaag cttatcaaga ggaattacgt 540 

agaagtgagg ccatggattt cgatgactfcg attatgatga cacttcgatt atttgatcag 60 0 

aataaagatg ttttggccta ttaccaacag aggtatcaat atatccatgt agacgagtat 660 

caagatacta accacgctca ataccaatta gttaagttat tagcttcgcg tttcaaaaat 72 0 

atttgtgttg ttggtgatgc cgatcaatcc atttacggat ggcgtggagc tgatatgcaa 78 0 

aatattcttg attttgaaaa ggactatccg caagccaaag ttgtattatt agaagaaaat 84 0 

tatcgatcga ctaagaaaat acttcaagct gctaataatg tgattaatca taataaaaat 900 

cgccgtccca aaaaattatg gactcaaaat gatgaaggtg agcaaattgt atatcataga 96 0 

gctaacaatg agcaagaaga agccgttttt gtagcatcaa ctattgataa catcgttcga 1020 

gaacaaggaa aaaatttcaa agattttgcc gttctttatc gtacgaatgc gcaatctcgt 1080 

actattgagg aagcactttt aaaatccaat attccatata caatggttgg tggaacaaaa 114 0 

ttctatagtc gtaaagaaat tcgagatgtt attgcttatc tcaatatcct tgcaaatact 1200 

tctgataata tttcttttga gcggattgta aatgaaccta aaagaggggt tgggccaggc 12 6 0 

actttagaaa aaatacggtc atttgcctat gaacagaaca tgtctcttct tgatgcctct 132 0 

tcaaatgtca tgatgtcgcc cttaaaaggg aaggctgctc aagctgtttg ggatttagct 138 0 

aatctgattt tgactctacg tagtaagcta gatagtttaa ctgtaacgga gattacagaa 144 0 

aacctactgg ataagacagg atatctagaa gcacttcaag ttcaaaatac attagaaagt 15 00 

caagcgcgta ttgaaaatat tgaagaattc ttgtcagtga ctaaaaattt tgacgataac 1560 

cctgagatta cagtagaggg tgaaactggt ttagatcgtc tatcgcgttt tttaaatgat 162 0 

ttagctttaa tagcagafcac tgatgatagc gccactgaaa cggctgaggt cactttaatg 16 8 0 

acattacatg ctgccaaggg tctagaattt ccagttgttt tcttaatcgg tatggaagaa 1740 

ggagtatttc ctttatcgcg tgcaatagag gatgctgatg aattagaaga agagcgtcgt 18 0 0 

ttagcttatg tcgggattac gagagcagaa caaatacttt tccttaccaa tgctaatacc 186 0 

cgtactttat ttggtaaaac tagttataat agaccaaccc gttttatacg tgagattgat 192 0 

gatgagctga tacaacatca agggttggct cgaccagtta attcttcatt tggtgtgaaa 198 0 

tattcaaaag aacagcctac acaatttggt caaggaatga gtcttcaaca agcgcttcag 204 0 

gcacgtaaaa gtaattcaca acctcaagta actgcccaac tccaggctct taacgcgaac 2100 

aa tagccatg agacatcatg ggagattggt gatgfctgcaa ctcataaaaa atggggggac 2160 

ggtacggtgc ttgaagtttc aggtagcgga aagactcaag agttaaaaat aaattttcca 222 0 

ggtattgggt taaagaaatt attagcgagt gtcgctccta ttagtaaaaa ggaaaactaa 22 8 0 

SeqID 82 

atgaaattat acgttcaatt aatggttatt ttaacctttt cattcgctgg tgaggttcta 6 0 

tctactatat tcaacttacc tgtaccaggt agtattattg gattaatatt actattccta 12 0 

gcgttaaaat acaaaatcat caggctcagg catattgatg ccgtggggaa ttttctatta 18 0 

gcaaatatga ccattttatt tttaccacca gcagttggac tgatggagca ctttcaagac 24 0 

ataaaaccct atctttttgg gatagctatc attatcttag gagcgctatt tctcaatatt 3 00 

ctaactattg gactggtatc ccaatggatc aaaaagagat atgaaggaga ttatccagaa 3 60 

attggaggta aaaatggcaa ctttaacgaa taa 3 93 

SeqID 83 

atgatttttg tcacagtggg gacacatgaa cagcagttca accgtattat taaagaagtt 60 

gatagattaa aagggacagg tgcfcattgat caagaagtgt tcattcaaac gggttactca 12 0 

gactttgaac ctcagaattg tcagtggtca aaatttctct catatgatga tatgaactct 18 0 

tacatgaaag aagctgagat tgttatcaca catggcggtc cagcgacgtt tatgtcagtt 24 0 

atttctttag ggaaattacc agtcgttgtt cctaggagaa agcagtttgg tgaacatatc 300 

aatgatcatc aaatacaatt ttttaaaaaa attgcccacc tgtatccctt ggcttggatt 3 60 

gaagatgtag atggacttgc ggaagtgttg aaaaggaata tagctacaga aaaatatcag 42 0 

ggaaataatg atatgttttg tcataaatta gaaaaaatta taggtgaaat atga 474 

SeqID 84 

atggtgatga aaatcataga gttaaaagaa gcaaccgtac aagtcagtaa tggtttagca 60 

gaaatgaaaa cgatattaga ccatgttaat ttgagtattt atgaacatga cttcattaca 12 0 

atattgggtg gaaatggagc tggaaaatca acgcttttta atgtaattgc aggtaccttg 18 0 

atgttaagcg gtggaaatat ttacatcatg ggacaagatg ttactaattt accagcagaa 24 0 

aaacgagcta aatatttatc acgggtattt caagatccga aaatgggaac agcacctagg 3 00 

atgacagttg ctgaaaattt attagttgct aaatttcgag gtgaaaagag accgttagtt 360 

cctagaaaga ttactaacta taccgaagaa tttcaaaaat tgattgctag aactggtaat 42 0 

ggacttgatc gccatttaga gacacctaca ggtttattat caggtggaca aagacaagca 48 0 

cttagtttat taatggcaac tttgaaaaaa ccaaatctgt tattattgga cgaacacaca 540 
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gcagcgcttg atcctagaac aagtgtttcg cttatgggct tgactgatga gtttatcaaa 600 

caagattcgt tgactgcctt aatgattaca catcatatgg aagatgctct caaatatgga 660 

aatcgtatat tagtaatgaa agatggcaag attgtcagag atttaaatca ggcacaaaaa 720 

aacaagatgg ctatagctga ctattatcaa ttatttgatt aa 762 

SeqXD 85 

atggaaatca aaaagaaaca tcgtattatg ctttattcag cccttatttt aggaacaata 60 

ttggttaaca atagttacca agctaaagct gaagagctta ccaaaactac ctcaacgtcc 12 0 

caaataagag atactcaaac taataatatt gaagttctcc agactgaaag taccactgtc 180 

aaagagacta gcaccacaac cacacaacaa gatctgtcta accccacagc ttcaaccgca 24 0 

actgcaacag ccactcatag cacaatgaaa caagtagtag ataatcaaac tcaaaataag 3 00 

gagctggtga aaaacggaga ttttaatcaa actaaccctg tatctggaag ctggtcacat 360 

acaagcgcta gggaatggtc tgcttggatt gataaagaaa atactgctga taaatcacct 42 0 

attatccaac gtaccgaaca aggccaagta agcctatcca gcgacaaagg ctttagaggt 48 0 

gctgtaacac aaaaagtgaa cattgatccc actaaaaaat atgaggtcaa gtttgatatt 54 0 

gaaacaagta acaaggctgg acaagctttc cttcgtatta tggagaaaaa agacaacaat 600 

acgcgacttt ggctttctga gatgaccagc ggtactacta acaaacatac cttaacaaag 660 

atatataacc caaagttaaa tgtctccgag gtgacacttg aactttatta tgaaaaagga 72 0 

actggttctg ctacttttga taatatatca atgaaagcaa aaggccctaa agactcagag 780 

catccacaac ccgtcacaac acaaattgaa gaaagcgtta atacggcttt aaacaaaaat 84 0 

tacgttttta ataaagctga ctaccaatac actctaacca atccgtctct cgggaaaatt 900 

gttggtggaa tattgtatcc aaacgctact ggttcaacaa ctgttaaaat atctgataaa 960 

tctggtaaaa taattaaaga agtaccgtta tcagttacag cttcaacaga agataagttt 102 0 

acaaaactcc tcgacaaatg gaacgacgtg actattggta atcatgttta cgatactaat 108 0 

gattcgaaca tgcaaaagat fcaatcagaaa ttagatgaaa ctaacgccaa aaacatcaaa 114 0 

actatcaaac tggattctaa tcacactttc ctttggaaag atttagataa tctcaataat 12 0 0 

tcagcacagt taaccgctac ttatcgtcgt ttggaagatt tagctaaaca aatcaccaat 126 0 

ccccactcta ctatttacaa aaatgaaaaa gctattcgta ctgtaaaaga gagtctggct 1320 

tggcttcatc aaaacttcta caatgttaat aaagatatag aaggctctgc caattggtgg 13 8 0 

gattttgaaa tcggtgtccc tcgctcaatt acagctaccc tagctctcat gaataactac 144 0 

ttcactgacg ctgaaataaa aacttatacc gacccaattg aacactttgt tcctgatgca 150 0 

ggatatttcc gtaaaacgct tgacaatcca tttaaagccc ttggtggtaa tctagtcgat 15 6 0 

a-tggggcgcg ttaaaatcat tgaaggttta cttcgtaaag acaatactat tatcgaaaaa 1620 

acttctcatt ctctaaaaaa tctttttact actgctacta aagctgaagg tttctatgct 168 0 

gacggttctt acatcgacca tacaaatgtt gcttatactg gcgcctatgg taatgttctg 174 0 

atagatggtt tgacacaatt gctgcctatc attcaagaaa ctgactataa aatctctaat 18 0 0 

caagaacttg atatggttta taaatggatt aatcaatcat ttttaccttt aattgtaaaa 186 0 

ggtgagttaa tggatatgag tcgtggacgc tcaattagta gagaggcagc ttcttcgcat 192 0 

gcggctgcag ttgaagttct cagaggtttc ctcagattgg ctaacatgtc taatgaagag 1980 

cgaaacttag acctcaaatc aactattaaa acgattatca cttcaaataa attctacaat 204 0 

gtcttcaata acctcaaatc gtattccgat attgccaaca tgaataagat gcttaatgac 210 0 

agtacagtcg ctactaaacc tttaaaaagt aatttatcaa cctttaatag catggaccgc 216 0 

ttagcttatt ataatgccga gaaagacttt ggtttcgcgc tttcattaca ttctaaacgt 2220 

accctcaact atgaaggaat gaatgatgaa aatacacgtg attggtatac cggagatggt 22 8 0 

atgttctatc tttataatag tgatcaatct cattatagta atcatttttg gccaaccgtc 234 0 

aatccttata aaatggctgg aacaactgaa aaagatgcta agcgtgaaga taccactaag 240 0 

gaattcatga gcaaacatag caaagacgct aaagaaaaaa ccggtcaagt tacaggaaca 24 60 

tctgactttg ttggttccgt caaacttaat gatcactttg ctcttgccgc tatggatttt 252 0 

actaactggg atcgcacctt aacagcacaa aaaggttggg ttatcttaaa tgataagatt 258 0 

gtctttttag gtagcaacat caagaatact aacggcattg gaaatgtttc tacaacaatt 264 0 

gatcaacgaa aagacgattc taaaacacct tatactacat acgtcaatgg aaaaactatt 270 0 

gatttaaaac aagcaagttc tcaacaattt acagatacaa aaagtgtctt tttagaatca 2760 

aaagaacctg gtcgcaatat tggttatatc ttctttaaaa atagcactat tgatattgaa 282 0 

cgcaaagagc aaacaggtac ttggaacagc attaatcgta cttctaaaaa tacctcaatc 288 0 

gttagcaatc cttttatcac tataagccaa aagcatgaca acaaaggtga tagctatggt 294 0 

tacatgatgg ttccaaacat tgatcgcaca agttttgata aattagccaa cagcaaagaa 3 000 

gtagaattac tagaaaatag ttcaaaacaa caagttatct atgataaaaa cagtcaaact 3 06 0 

tgggctgtta tcaaacacga taatcaagag agtctcatta acaatcaatt caaaatgaat 312 0 

aaagcgggac tttacctagt acaaaaagtt ggtaatgact atcaaaatgt ctattaccaa 3180 

cctcaaacca tgacaaaaac agaccaatta gctatctaa 3219 

SeqID 86 

atgcattctt tttctaatcc tggatatcct tatgataatg ccgtaactga agcatttttc 6 0 

aagtatttaa agcatagaca aatcaaccga aaacattatc aaaatatcaa acaggttcaa 12 0 

ttagactgct ttgaatacat tgagaatttt tataacaatt acaacccaca tacggctaat 18 0 
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ctaggactaa cccctaatca gaaagaagaa aattatttta acgcaataaa ataa 234 
SeqID 87 

gtgaagaaaa catatggtta tatcggctca gttgctgcca ttttactagc tactcatatt 60 

ggaagttacc aacttggtaa gcatcatatg ggtctagcaa caaaggacaa tcagattgcc 12 0 

tatattgatg acagcaaagg taaggcaaaa gcccctaaaa caaacaaaac gatggatcaa 18 0 

atcagtgctg aagaaggcat ctctgctgaa cagatcgtag tcaaaattac tgaccaaggc 24 0 

tatgtgacct cacatggtga ccattatcat ttttacaatg ggaaagttcc ttatgatgcg 3 00 

attattagtg aagagttgtt gatgacggat cctaattacc gttttaaaca atcagacgtt 3 60 

atcaatgaaa tcttagacgg ttacgttatt aaagtcaatg gcaactatta tgtttacctc 42 0 

a-agccaggta gtaagcgcaa aaacattcga accaaacaac aaattgetga gcaagtagcc 48 0 

aaaggaacta aagaagctaa agaaaaaggt ttagctcaag tggcccatct cagtaaagaa 540 

gaagttgcgg cagtcaatga agcaaaaaga caaggacgct atactacaga cgatggctat 60 0 

atttttagtc cgacagatat cattgatgat ttaggagatg cttatttagt acctcatggt 660 

aatcactatc attatattcc taaaaaggat ttgtctccaa gtgagctagc tgctgcacaa 72 0 

gcctactgga gtcaaaaaca aggtcgaggt gctagaccgt ctgattaccg cccgacacca 78 0 

gccccaggtc gtaggaaagc cccaa-ttcct gatgtgacgc ctaaccctgg acaaggtcat 840 

cagccagata acggtggcta tcatccagcg cctcctaggc caaatgatgc gtcacaaaac 900 

aaacaccaaa gagatgagtt taaaggaaaa acctttaagg aacttttaga tcaactacac 960 

cgtcttgatt tgaaataccg tcatgtggaa gaagatgggt tgatttttga accgactcaa 1020 

gtgatcaaat caaacgcttt tgggtatgtg gtgcctcatg gagatcatta tcatattatc 1080 

ccaagaagtc agttatcacc tcttgaaatg gaattagcag atcgatactt agctggccaa 114 0 

actgaggaca atgactcagg ttcagagcac tcaaaaccat cagataaaga agtgacacat 12 00 

acctttcttg gtcatcgcat caaagcttac ggaaaaggct tagatggtaa accatatgat 12 60 

acgagtgatg cttatgtttt tagtaaagaa tccattcatt cagtggataa atcaggagtt 1320 

acagctaaac acggagatca tttccactat ataggatttg gagaacttga acaatatgag 13 80 

ttggatgagg tcgctaactg ggtgaaagca aaaggtcaag ctgatgagct tgctgctgct 1440 

ttggatcagg aacaaggcaa agaaaaacca ctctttgaca ctaaaaaagt gagtcgcaaa 15 00 

gtaacaaaag atggtaaagt gggctatatg atgccaaaag atggtaagga ctatttctat 15 60 

gctcgtgatc aacttgattt gactcagatt gcctttgccg aacaagaact aatgcttaaa 1620 

gataagaagc attaccgtta tgacattgtt gacacaggta ttgagccacg acttgctgta 1680 

gatgtgtcaa gtctgccgat gcatgctggt aatgctactt acgatactgg aagttcgttt 1740 

gttatcccac atattgatca tatccatgtc gttccgtatt catggttgac gcgcgatcag 18 00 

attgcaacag tcaagtatgt gatgcaacac cccgaagttc gtccggatgt atggtctaag 18 60 

ccagggcatg aagagtcagg ttcggtcatt ccaaatgtta cgcctcttga taaacgtgct 192 0 

ggtatgccaa actggcaaat tatccattct gctgaagaag ttcaaaaagc cctagcagaa 19 8 0 

ggtcgttttg caacaccaga cggctatatt ttcgatccac gagatgtttt ggccaaagaa 2 040 

acttttgtat ggaaagatgg ctcctttagc atccoaagag cagatggcag ttcattgaga 2100 

accattaata aatctgatct atcccaagct gagtggcaac aagctcaaga gttattggca 2160 

aagaaaaata ctggtgatgc tactgatacg gataaaccca aagaaaagca acaggcagat 2220 

aagagcaatg aaaaccaaca gccaagtgaa gccagtaaag aagaaaaaga atcagatgac 228 0 

tttatagaca gtttaccaga ctatggtcta gatagagcaa ccctagaaga tcatatcaat 2340 

caattagcac aaaaagctaa tatcgatcct aagtatctca ttttccaacc agaaggtgtc 24 00 

caattttata ataaaaatgg tgaattggta acttatgata tcaagacact tcaacaaata 2460 

aacccttaa 2469 

SeqID 8 8 

atgaaaaaag gtttttttct catggctatg gttgtgagtt tagtaatgat agcagggtgt 60 

gataagtcag caaaccccaa acagcctacg caaggcatgt cagttgtaac cagcttttac 120 

ccaatgtatg cgatgacaaa agaagtatct ggagacctca atgatgtgag gatgatccaa 18 0 

tcaggtgcag gcattcattc ctttgaaccg tctgtaaatg atgtggcagc tatttatgac 240 

gcggatttgt ttgtttacca ttcacatacc ttagaagctt gggcaaggga tctagaccct 3 00 

aatttaaaaa aatcaaaggt tgatgtgttt gaagcgtcaa aacctttgac actagataga 3 60 

gtcaaagggc tagaagatat ggaagtcaca caaggcattg acactgcgac actttatgac 420 

ccacatacct ggacggatcc cgttttagct ggtgaggaag ctgttaatat cgctaaagag 480 

ctaggacgtt tggatcctaa acacaaagac agttacacta aaaaggctaa ggctttcaaa 540 

aaagaagcag agcaactaac tgaagaatac actcaaaaat ttaaaaaggt gcgctcaaaa 60 0 

acattcgtga cgcaacacac ggcattttct tatctggcta aacgattcgg cttgaaacaa 66 0 

cttggtatct cgggtatttc tccagagcaa gagccctctc ctcgccaatt gaaagaaatt 72 0 

caagactttg tcaaagaata caacgtcaag actatttttg cagaagacaa cgtcaatccc 780 

aaaattgctc atgctattgc gaaatcaaca ggagctaaag taaagacatt aagtccactt 84 0 

gaagctgctc caagcggaaa caagacatat ctagaaaatc ttagagcaaa tttggaagtg 90 0 

ctctatcaac agttgaagta a 921 



SeqID 8 9 
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ttgcgtaaaa aacagaaact accatttgat aaacttgcca ttgcgcttat gtctacgagc 60 

atcttgctca atgcacaatc agacatcaaa gcaaatactg tgacagaaga cactcctgct 12 0 

accgaacaaa ccgtagaaac tccacaacca acagcggttt ctgaggaagc accatcatca 18 0 

aaggaaacta aaaccccaca aactcctagt gatgcaggag aaacagtagc agatgacgct 24 0 

aatgatctag cccctcaagc tcctgctaaa actgctgata caccagcaac ctcaaaagcg 3 00 

actattaggg atttgaacga cccttctcag gtcaaaaccc tgcaggaaaa agcaggcaag 3 60 

ggagctggga ctgttgttgc agtgattgat gctggttttg ataaaaatca tgaagcgtgg 42 0 

cgcttaacag acaaagccaa agcacgttac caatcaaaag aagatcttga aaaagctaaa 48 0 

aaagagcacg gtattaccta tggcgagtgg gtcaatgata aggttgctta ctaccatgac 54 0 

tatagtaaag atggtaaaac cgctgtcgat caagagcacg gcacacacgt gtcaggaatc 60 0 

ttgtcaggaa atgctccatc tgaaacgaaa gaaccttacc gcctagaagg tgcgatgcct 66 0 

gaggctcaat tgcttttgat gcgtgtcgaa attgtaaatg gactagcaga ctatgctcgt 720 

aactacgctc aagctatcag agatgctatc aacttgggag ctaaggtgat taatatgagc 780 

tttggtaatg ctgcactagc ttacgccaac cttccagacg aaaccaaaaa agcctttgac 84 0 

tatgccaaat caaaaggtgt tagcattgtg acctcagctg gtaatgatag tagctttggg 9 00 

ggcaagaccc gtctacctct agcagatcat cctgattatg gggtggttgg gacacctgca 9 60 

gcggcagact caacattgac agttgcttct tacagcccag ataaacagct cactgaaact 102 0 

gctacggtca aaacagccga tcagcaagat aaagaaatgc ctgttctttc aacaaaccgt 1080 

tttgagccaa acaaggctta cgactatgct tatgctaatc gagggacgaa agaggatgat 1140 

tttaaggatg tcaaaggtaa gattgccctt attgaacgtg gcgatattga tttcaaagat 12 0 0 

aagattgcaa aagctaaaaa agctggtgct gtaggagtct tgatctatga caatcaggac 1260 

aagggcttcc cgattgaatt gccaaatgtt gatcagatgc ctgcggcctt tatcagtcga 13 2 0 

aaagatggtc tcttattaaa agacaatccc caaaaaacca tcaccttcaa tgcgacacct 13 80 

aaggtattgc caacagcaag tggcaccaaa ctaagccgct tctcaagctg gggtctgaca 1440 

gctgacggta atattaagcc agatattgca gcacccggcc aagatatttt gtcatcagtg 15 00 

gctaacaaca agtatgccaa actttctgga actagtatgt ctgcgccatt agtagcgggt 15 60 

atcatgggac tgttgcaaaa gcagtatgag acacagtatc ctgatatgac accatcagag 1620 

cgtcttgatt tagctaaaaa agtattgatg agctcagcaa ctgccttata tgatgaagat 1680 

gaaaaagctt atfctttctcc tcgccaacaa ggagcaggag cagtcgatgc taaaaaagct 1740 

tcagcagcaa cgatgtatgt gacagataag gataatacct caagcaaggt tcacctgaac 18 00 

aatgtttctg ataaatttga agtaacagta acagttcaca acaaatctga taaacctcaa 18 60 

gagttgtatt accaagcaac tgttcaaaca gataaagtag atggaaaaca ctttgccttg 1920 

gctcctaaag cattgtatga gacatcatgg caaaaaatca caattccagc caatagcagc 198 0 

aaacaagtca ccgttccaat cgatgctagt cgatttagca aggacttgct tgcccaaatg 2 04 0 

aaaaatggct atttcttaga aggttttgtt cgtttcaaac aagatcctaa aaaagaagag 2100 

cttatgagca ttccatatat tggtttccga ggtgattttg gcaatctgtc agccttagaa 2160 

aaaccaatct atgatagcaa agacggtagc agctactatc atgaagcaaa tagtgatgcc 2220 

aaagaccaat tagatggtga tggattacag ttttacgctc tgaaaaataa ctttacagca 2280 

cttaccacag agtctaaccc gtggacgatt attaaagctg tcaaagaagg ggttgaaaac 2340 

atagaggata tcgaatcttc agagatcaca gaaaccattt ttgcaggtac ttttgcaaaa 24 00 

caagacgatg atagccacta ctatatccac cgtcacgcta atggcaagcc atatgctgcg 24 60 

atctctccaa atggggacgg taacagagat tatgtccaat tccaaggtac tttcttgcgt 2520 

aatgctaaaa accttgtggc tgaagtcttg gacaaagaag gaaatgttgt ttggacaagt 25 80 

gaggtaaccg agcaagttgt taaaaactac aacaatgact tggcaagcac acttggttca 2 640 

acccgttttg aaaaaacgcg ttgggacggt aaagataaag acggcaaagt tgttgctaac 2700 

ggaacctaca cctatcgtgt ccgctacact ccgattagct caggtgcaaa agaacaacac 2760 

actgattttg atgtgattgt agacaatacg acacctgaag tcgcaacatc ggcaacattc 2 820 

tcaacagaag atcgtcgttt gacacttgca tctaaaccaa aaaccagcca accggtttac 28 80 

cgtgagcgta ttgcttatac ttacatggat gaggatctgc caacaacaga gtatatttct 2 94 0 

ccaaatgaag atggtacctt tactcttcct gaagaggctg aaacaatgga aggcgctact 3 000 

gttccattga aaatgtcaga ctttacttat gttgttgaag atatggctgg taacatcact 3 060 

tatacaccag tgactaagct attggagggc cactctaata agccagaaca agacggttca 3120 

gatcaagcac cagacaaaaa accagaagct aaaccagaac aagacggttc aggtcaaaca 3180 

ccagataaaa aaacagaaac taaaccagaa aaagatagtt caggtcaaac accaggtaaa 3240 

actcctcaaa aaggtcaacc ttctcgtact ctagagaaac gatattctaa gcgtgcttta 3 3 00 

gctacaaaag catcaacaag agatcagtta ccaacgacta atgacaagga tacaaatcgt 33 60 

ttacatctcc ttaagttagt tatgaccact ttcttcttgg gattagtagc tcatatattt 3420 

aaaacaaaac gccaaaaaga aactaaaaaa tag 3453 

SeqID 9 0 

tttagtgtaa cctattcaca gtctgaacgt acggttgttt tctcttttgg agaaataaca 60 

tttagtagga gtcgctggac aaatggcttt gaaactagaa taccagtaga tgagtggtta 120 

ggtcttgaaa aatataagag atattcaata gaattcttat atcatgttgc aaaattggct 18 0 

acaatgatgc cttatcgtca agtttgcaaa gtaatagata gcactttgca aacaatcata 240 

acaaaagact gtgttttaaa agcagtaaaa tttgtagaaa aattgttaaa agaaaaagaa 3 00 
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cgcfcatcgtt 
gagggtgatg 
ttaacacatt 
caggacaagc 
tatatttata 
ggtaaaggct 
catgagcatt 
tatccaattg 
cttgaattag 
tttcagaagt 
agaaatcttt 
agaatgaagc 
attatacttg 
tacagtgagt 
ttagataaat 
aaaacaaaat 



tttatttgga 

gagtcatgat 

ttgttattca 

acgaaatatt 

ataactatga 

atactagtag 

tttgggatat 

aattaaccga 

tttttgatac 

ttaagaaaaa 

caaatcgtgg 

gacgtggcat 

aaagagctaa 

ataaagaagg 

tttctaagcc 

ag 



agagccaccc 

taaaagcaca 

tacaggctca 

acagcttaat 

agtagatgac 

agtttttaag 

ctatcatgtt 

ttttgcttta 

tgttgaatca 

agtattaaat 

ta.ttggtatc 

gtattggtca 

cggtttacga 

ttcatttagt 

ccttctaaaa 



38/109 

gaacgtaaaa 
gattctagag 
aaaaaagttt 
tatgataaag 
gatactattt 
gaattaggaa 
aaagaaaagt 
gatgcggtaa 
ctgatttgtg 
aatttcaaat 
at ggaa t cac 
aagtggggaa 
gaattatttt 
gcagggcgac 
aatggcagaa 



aagtgaaaaa 

aggaaagaag 

ctactaaaag 

ctaaatataa 

taatcactaa 

aagcacttaa 

taagttcata 

aaaaatataa 

atgaacttga 

atataaaacc 

aacacagaaa 

tctccacaat 

tcggfctcttg 

tttttaaaaa 

aatggagtat 



actgtatgtt 

gtatttagat 

atatgaattg 

tcttttagat 

ctctgatatg 

ggtaaagaaa 

ccttagaaaa 

ttctgataag 

agatcaagaa 

agctcatctt 

gataacgtat 

ggcaaatatg 

gagaaaggta 

gacagatgaa 

aacaggaatc 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1212 



SeqID 91 

aaagcaagca 

tacagcattt 

gccgtggagt 

agcgagtcta 

ctttcacgtc 

aaagccacct 

gacaggcaat 

cgtgtaggca 

atcggcctgt 

atcccttacg 

aatcagttaa 

ggttgtcctt 

gtttatcaag 

cttttctgta 



gcggtgatta 
caaagatgtg 
cagtatctaa 
gatacggatc 
gtcggattcg 
tcaaaccaca 
ttaagcaaga 
atcgttgggc 
ctcttggttg 
ccctgaccaa 
ttgatgaaat 
atgacaatgc 
aaacctttca 
caacatttat 



tggcacaaaa 
tcgctggctg 
aacggagttt 
cagaaaaatc 
acgcattatg 
ttctagaggc 
aagaccacta 
ttatgtttgc 
gcacaagacc 
agtcaagatg 
attggaagcc 
cgtagctgaa 
atcgctggaa 
aaagtgttgc 



agggaaataa 
aatatgccac 
gaagaaacta 
aaaatatgct 
aagcgactca 
aagaatgaag 
caagccttag 
ctcatcattg 
gctgaactcg 
ttccattcag 
tttggaatca 
agtacgtatc 
gaactagctc 
cttttcaggt 



tcactgctaa 
gctcaagtta 
ttaaaagaat 
tgaataacga 
atttggtttc 
cccctattcc 
tcactgactt 
acctatacaa 
ttaagcaagc 
atcgtggcaa 
cacgttcgct 
gtgctttcaa 
ttaagactga 
ttttaccaat 



taaggataaa 
ttactatcaa 
ttttctcgat 
aggtatcaca 
tgtttatcag 
caaccactta 
aacctatgtt 
ccgtgaaatc 
catacaaagc 
agagtttgat 
tagtcaggct 
aattgaattt 
aaaggcaaca 
gctataa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
837 



SeqID 92 

atgaagacaa 

tctaaacagg 

ggtcagccta 

ccttttatgg 

tttgagaatc 

aaggaggtca 

SeqID 93 

ttgtccactt 

aacctttctg 

acaccatcta 

ggaaatgaac 

ttgaatcagg 

tctacaaaac 

atcttattgg 

ggttatccaa 

ctaagtcatg 

caggtcgctt 

acagaccgct 

caagagccag 

cattatccac 

gaagctgcgg 

attgaacaaa 

ttgtctcgtt 

tcgtttatgg 

accgataaag 

tcagttcctg 

tttcctgaag 

ctcgtagcta 

gaacaagaaa 

gagtacaatc 



gaaatcgtaa 
caattcattg 
ctggggagat 
tgaagtttga 
ttcaagcttg 
aataa 



aggtggttat 
cttgagtgta 
tatcgcttat 
aaatgaagtg 
tgaggttggg 



ttagcgaata 
gaacttgaac 
aaggcttggt 
acactaccag 
ttcaatgttt 



ctgcaaatga 
cgcaaattag 
tctctcaaaa 
catatatggt 
atttcaaggc 



gtacatcgac 
gtttgaagag 
agggcttccg 
aatggtgcaa 
agacaatctc 



ttgattcagt 

aaaaatttgc 

ttcagctagg 

tgttgggtta 

ccgaaatgat 

tagaggagct 

atgccgttgg 

aagcgagtgt 

ataaggtaga 

ttgctttcct 

ttcgtcatac 

aaatggtgga 

ttggttcaag 

acatggataa 

atcccgttct 

cggaaattgt 

atgaggggac 

ttgttacatt 

aaaccaaaga 

atttgactaa 

ttcgtcttgt 

ttcttgccaa 

cgaagttttc 



tacaggaagg 
cattttaagt 
ggagtttgat 
tttagaggca 
ggcagtcctt 
gggaattgat 
tcgttttcgc 
ttcccagcta 
atttttctta 
acatgaaagc 
aagcttaaac 
gatgtcagac 
ggtcacttat 
tctgattcga 
ctatttccaa 
agaaagagag 
agaggataac 
agatagtcag 
ggtgattgaa 
tttttacccc 
aaaagaatta 
atatgtgggt 
taaggaaaga 



accaatcact 
cactttgaca 
agggaaatgg 
aatggcagcc 
gagcttagcc 
ttgggttcat 
ttgaaaaatg 
gctcttgcga 
accagtcagc 
ttgaccagag 
tttcgagaat 
gtcagcccgc 
aaaggacagg 
atagagcttc 
aacttggcag 
gaacaacctg 
gaaaaggaac 
ccagcacaag 
gcagaacaag 
aaaacagcta 
gaaagtgcta 
tggggtggtc 
gaggagttga 



tggaagctta 
aacttagaaa 
aagctttttc 
cctatgagtt 
gtcaactggt 
ttcaaccaga 
cggacattgc 
cagaactcct 
ttcagttaga 
aagaagcaga 
ggcgagaact 
ttgttcggga 
agtttgagat 
aaaatgattt 
aaatcaggca 
aggaagaatg 
ctgagattgt 
taagtgagac 
caatagcagt 
gggataaagt 
atcaatcagc 
ttgccaatga 
aaactctcgt 



tgaagggctg 
tgagctgcag 
aatcactttg 
gaagcgagaa 
tgcaaaattt 
ccaagtcaac 
tttattaggc 
ccagatggga 
ggatatgcga 
gcaatttgaa 
tctagaaaag 
agtcttgcat 
cttgtctatt 
ttcatatctc 
agttcttcat 
gactctcttt 
ttttgaatct 
tatatccgaa 
tgattttagc 
agtggcgaat 
aacacctaat 
attctttgat 
ttctgataaa 



60 
120 
180 
240 
300 
315 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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gagtattccg atatgaaaca atcatctcta acggcctact atacggatcc gcttttaatt 144 0 

cgtcagatgt ggtctaagtt agaacaagac ggctttagtg gtggaaaaat attagaccca 150 0 

tcaatgggaa cagggaatfct ctttgcggct atgccagctc atttgaggga aaagagtgag 1560 

ctgtgtgggg ttgagttaga tactatcaca ggagcgatag ctaagcagct tcattcaaat 162 0 

gtccatattg aagtaaaggg atttgagacg gttgctttta atgataatag ttttgatttg 168 0 

gttatttcaa atgttccctt tgccaatatc cgtatcgcgg ataatcagta cgataagccc 174 0 

tatatgattc atgactactt tgtcaagaaa tctcttgatt tggtacatga tggtgggcaa 1800 

gtagccatta tttcctccac aggaaccatg gacaaacgga cagaaaatat tctccaagat 1860 

attcgtgaga cgacggattt tcttggtggc gtgcgtttgc cagatacggc ttttaaggcc 192 0 

attgcaggta caagtgtgac aacggatatg cttttctttc agaaacatct gaacaaaggc 198 0 

tatcaagcag atgacattgc cttttcaggt tctgttcgtt atgacaagga cgagcgtatt 2040 

tggctcaatc cctactttga tggggagtat aatgcccagg ttcttggaag gtatgagatt 2100 

aaaaacttta acggtggaac actttcggtt aaggaaacga ctgataacct gattgcgagt 2160 

gttcgagaag ccttgcaaca tgtaaaagca ccaagggtta ttgataaaac agaagtcatg 2220 

attaactcgg atgtgattgc aagacaggtc atagatacca ccattccacc tgaaattaga 22 8 0 

gaaaaccttg aacaatatag ctatggttac aaaggttcta ccatttatta tcgtgataat 234 0 

aaggggattc gtgttgggac aaaaacggag gaaattagct attatgtgga tgatgaggga 24 0 0 

aactttcagg cttgggagtc caagcattca caaaaacaga ttgaccgttt taataactta 24 60 

gaagtcacag atagcacagc tcttgatgtt tatgtgactg aagaacctgc gaaacgtggg 252 0 

caatttaaag gatatttcaa aaaggcggtt ttttatgaag ctcctttgtc tgaaaaagaa 258 0 

gtggcacgga ttaaaggaat ggtagatatt cgcaatgcct atcaagaggt cafctgctatt 2640 

caacgcaatt atgattacga taagaatgag tttaaccgct tgttaggaaa cctcaatcgc 270 0 

acctatgata gctttgtcaa acgctatggc ttttfcgaaca gtccagttaa ccgaaattta 2760 

tttgatagtg atgataagta ttcgctttta gctagtctcg aagatgaaag tcttgaccca 282 0 

agtgggaaga cggtcattta tacgaagtca ctggcttttg aaaaagcctt ggtccgtcct 28 8 0 

gaaaaaatgg ttaaagaggt gtcaaccgct cttgatgcac tcaactccag tcttgcggat 294 0 

ggtcgtggag ttgactttga ttacatggct tccatctatc aaacagcatc taaggcagct 3 000 

ttgattgagg agctgggaga tcagattata ccagatccag aaagttattt aaaaggacag 3 060 

ctgacctatg tatctcgcca ggagttttta tcaggagata ttgtgacaaa attagaagtt 312 0 

atggacttac tgttaaagca agacaatcat gattttaact gggcgcatta tggtaatttg 3180 

ttagaaagtg ttcgtccagc acgagtgatg ttggcagata ttgattatcg aattggttca 324 0 

cgttggattc ctttagctgt ttatggaaaa tttgtccaag aagcctttat ggggaagaat 33 00 

tatgacttaa cggctacaga agtggaagag gtcctctctg tcagcccgafc tgatgggacg 33 60 

atggaatttc ggacacggtt tgccfcatacc tactcaacag cgacggatag aagcctgggt 342 0 

gtagcaggct cacgctatga tagtggtcgg aaaatctttg agaatctgct taactccaat 348 0 

cagccaacca tcacgaagca gattcaagag ggagataaga aaaagaatgt gacagatgtg 354 0 

gagaagacaa ccgttctgcg tgctaaagaa gcacaaatcc aagacctctfc tcaagatttt 3 6 00 

gtcgcaagct atcctgaagc ccaacagatg attgaggata cctataatag tctctataat 3 660 

cgtaccgtct ctaaagtfcta tgatggcagc cgtttggaga ttgatggttt ggctcaaaat 3 72 0 

afctfccgcttc gtcctcatca aaaaaatgcc attcagcgaa tcgtggagga aaaacgggct 3 78 0 

cttttagctc atgaagtggg ttcaggcaaa actttaacca ttcttggagc aggctttaaa 3 84 0 

ctgaaagagt taggtatggt acataaaccg ctttatgtgg taccttctag cttgaccgct 3 90 0 

cagtttggtc aagaaatcat gaagttcttc ccaacgaaga aagtctacgt gacgacaaag 39 60 

aaagattttg ccaaagctcg gcgcaagcag fcttgtctcac gtattattac aggggattac 402 0 

gatgccattg tcatcgggga ctcccagttt gaaaaaatcc ccatgagtca ggaaaaacag 40 8 0 

gtgacttata ttcaagataa gttggaacaa ttacgggaga taaagcaagg aagcgatagt 414 0 

gactacacag tcaaagaagc ggagcgttcg attaaaggtt tggaaaacca attggaagaa 42 00 

ctccagaaac tagatcgaga tacctttatt gaatttgaaa atcttggcat tgattttctc 42 60 

tttgtggacg aagcccatca tttcaaaaat attaggccga ttactggtct gggaaatgta 432 0 

9 c gggaatta ccaacacgac ttctaaaaag aacgtggata tggaaatgaa agtgagacag 43 8 0 

attcaggcag agtatggaga tagaaatgtc gtttttgcga caggaacgcc ggtgtctaat 444 0 

tccattagtg aactctatac tatgatgaac tatattcagc cagatgtcct agaacgctac 45 0 0 

caagtttcta actttgactc ctgggttgga gcttttggca atatcgaaaa ttcgatggag 4560 

ctagcgccaa caggagataa gtatcaaccc aagaaacggt ttaaaaagtt tgtgaatcta 4 62 0 

cctgaactca tgcggattta taaagaaacc gctgatattc aaacgtcaga tatgttggat 468 0 

ttacctgtac ctgaagccac agtcattgcg gtggaaagtg aactcacaga agctcagaaa 474 0 

aactaccttg aagaattggt ggaccgttca gatgcgatta aatcaggaag cgttgaccct 48 0 0 

agtgtcgata acatgttaaa ggtgataggc gaagccagaa agctagccat tgatatgcgt 48 60 

ttgattgacc ctgcctatac cttatctgac aatcagaaga ttatgcaagt agtggataac 492 0 

gtggaacgga tttaccgtga gggcaaagga gataaggcta cacagatgat tttctctgac 4980 

attggcacgc ctaaaagtaa agaagaagga tttgatgtct ataatgaatt gaaagccctc 5040 

ttagttgacc gaggaattcc aaaagaagaa attgcctttg tccatgacgc taatacggat 510 0 

gagaagaaaa attctttgtc acgaaaggtc aatagtgggg aagtaaggat tctcatggct 5160 

tcaactgaaa aaggaggaac aggattaaac gtgcaagcac gcatgaaagc tgttcaccat 522 0 

ttagacgttc cttggcgtcc ctcagacatt cagcaacgca atggacgttt gattagacaa 528 0 
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ggaaatcaac atcagaatgt agagatttat cattatatta ccaaaggttc atttgataac 53 40 

tatttgtggg ctacacaaga gaataagttg cgttatatta aacaaattat gacttcaaaa 540 0 

gatcctgttc ggtcagcaga agatattgac gagcaaacca tgaccgcttc agatttcaaa 5460 

gcgttagcta ctggcaatcc ctatctcaaa ctcaaaatgg agttagagaa tgaattgacc 5520 

gtcttagaaa atcagaaacg tgctttcaat cgctccaaag atgaatacag gcacactatt 558 0 

tcctattgcg agaaaaactt acctgttatg gaaaaacgat taaggcagta tgacagagat 564 0 

attgagaaat cacaagcaac gaagaatcaa gagtttatca tgcggtttga taaccagaca 570 0 

attgacaatc gcagcgaagc tggggattat ttacgaaaac tcatcaccta taatcgttcc 5760 

gaaaccaaag aagtcagaac cttagcgacc tttagaggct ttgagcttaa aatggctetca 5820 

cgaagtcctg gtgaaccctt gtctgacatg gtgtctctaa ctatttcagg ggataaccag 5 88 0 

tattcagttt cccttgattt aaaatcagac gtgggaacga ttcaacggat taacaacgcc 594 0 

attgaccata tcctagagga taaggaaaag acagaagaga tgacaaacaa cctcaaagat 60 00 

aagctagcag tcgcaagagt tgaagttgag aaagtctttg caaaggaaga ggaatatcag 606 0 

ttggtaaagg ccaagtatga tgtacttgcc ccattagtgg aaagagaagc agacttagaa 612 0 

gaaattgacg tagcactttc ccaatttagt agctctgacc catgcttgaa aaaagatcaa 618 0 

cagttagtcc ttgacattta a 62 01 

SeqlD 94 

atgaccacaa aatgcaatca tcattttctc atcaatcagg agaaaggcga aaagtatgtt 6 0 

tttcgtaaaa gtaagcaata tcgcacgctg tgttcggttg ctttaggaac agttgtgatg 12 0 

gcattcgtag ctttggccgg accgatggtg caggcggatg aagtagggag aactgttgcg 18 0 

acaagtgtac agacggaaac gaatcctgca acaaatttaa aagaaaatca gccaagtccg 240 

attgcggaac aaaaagatag tcttgcagca accggtcaat caactggaac ggtgacagtc 3 00 

actgttccac atgataaggt gacacaagct gtcgataagg caaagaccga aggaataaaa 3 60 

gcagtacaag acaagccgat ggacttaggc aatacagtat ccgcagctga aaccagccaa 420 

caactcaaaa aggcagaaga agatgccaca aaccaaacaa caactatttc taaaactgtt 480 

gaaatctaca agtctgataa agcaacttat gaagctgaaa agaagtgggt agagaagcgt 540 

aatgaagagt taactgctgc ttatgataag gcagaacaaa cagggactgg cttaaaccat 6 00 

tcggttgata cgactgtttc agaattgaag tcgcaagacc aaaacgctca tgtgaccgtg 660 

aacacacaaa cagtaaaatc aggagatggg acgagtgttt caggctatca ggagtacgtc 720 

aagtctgttg cggccattga taagaagaat aaagcgaact tagctgatta tcggactaag 78 0 

aaacaagccg cagacgctgt tgtggctaag aatcaactca ttcaaaaaga gaatgaagct 840 

gggcttgcta aggcaaaagc agaaaatgaa gcgattgaca gacggaataa agaagggcaa 900 

aaagctgttg atgaagcaaa taaggctggg caagccgcag tagagcaagc gaaccaagaa 960 

aaacaaaaac aggccgcaaa ccgtgctttt gaaattgcga caattaccaa acggaataaa 1020 

gaaagagaag aagtcgcaaa gaaagaaaat gcagcgattg atgcttataa tgcgaaagaa 1080 

tggatccgct ataagcggga tttagcaaac atctcaaaag gggaggaagg ctacatttca 114 0 

gaagcccttg cgcaggctct agatttaaat catggcgaac cgcaggtcaa acatggtgca 12 00 

ggtactcgaa atccagatcg aatcatttca aagggagatg ccatgttggg tggctattct 1260 

aacattcttg attcaacggg tttctttgtc tacaatcact ttaaaacagg tgaaacgctg 132 0 

aactttacct atcaaaatct gaagcatgca cgttttgatg ggaagaaaat tacagccata 13 8 0 

acttatgata ttaccaatct ggtttcacca actggaacca atgcggtgca gttagttgtt 144 0 

ccaaacgacc caacagaagg ctttattgct tatcgcaatg atggcgctgg aaattggcgg 1500 

acagataaga tggagttccg tgtcaaagct cggtatttct tagaagacgg ttcacaagtg 156 0 

acctttacta aagaaaaacc aggtgtcttt acccactcgt cactcaatca taatgatatt 162 0 

gggcttgaat atgttaaaga ctcatcaggt aaatttgttc ctattcatgg ttcaagtgtg 168 0 

caagtaacca atgagggctt agcccgttcg ttaggttcaa accgagcgag tgacttgaag 174 0 

ttgccagaag aatgggatac gacttctagt cgctatgctt ataaaggagc gattgtatca 18 00 

acagtcacat cggggaatat ttataccgta acctttggcc aaggagatat gccaacccaa 186 0 

gtgggaggga agacctattg gtttgcttta aatactttgc cagtcgcaaa aacagtgact 192 0 

ccctacaatc caaagactca tgtaagacca cagctggatc cagttcctga accgataaaa 198 0 

gttacaccag aaacttatac tcctaaaatc tttactcctg aaaaaccagt aacctttact 2 04 0 

ccaaaatcag tagaaaaagt gccccaacct agtttgacct taacaaaagt cacactacca 210 0 

acaaatctga agctagaacc attacccaaa gctccacaaa agccaaccgt tcattaccac 2160 

gattacctct taaccacaac acccgctatc gcaaaagaag tgatgaatgt tgacaaagtt 222 0 

aatcttcatg gtaaacaggt ggctaaggat tccactgtta tttatccctt gacagtagat 228 0 

gttttatctc caaatcgttc caagataacc agtcttatct ttgaagatta tctgcctgct 234 0 

ggttatgcgt ttgatatgac gaagacacaa gcggagaata gcgactatga cttaaccttt 24 0 0 

gataaaaata agaactttgt gaccttgaaa gcgaaagata gcttacttca aacgttgaat 24 6 0 

aa^agagttaa acaagtctta tcaactgtct gctccaaaac tttatggttc agttcaaaat 252 0 

gatggggcga cttattctaa tagctataaa ctccttatta acaaggacac cccaaacacc 258 0 

tatacggtta tttcaaacgt agttagaatt cggactccag gagacggtga aacgaccagc 264 0 

cgaatccggc ctaaaaagga caatgaaaat gcggacggtg tgttgataaa tgatacggtt 2700 

gttgctctta gtacaaccaa tcattaccgt ttgacctggg atttggatca gtacaagggg 276 0 

gatacttctt caaaagatac cattgcacga ggattccttt ttgtagatga ctacccagaa 282 0 



WO 2004/099242 PCT/EP2004/004856 

41/109 



2880 
2940 
3000 
3060 
3120 



3360 
3420 
3480 
3540 
3600 



3780 
3840 
3900 



gaagcacttg acttggtaga taaagggaca gttatcacca cccttgatgg taaagctgtt 
tcaggtattt ctgtatacag ctatacatcc cttgataaag caccaaaaga gctacaagat 
aaattagctc gtgcgaacat ttcgccaaaa ggagcgttcc aggtctttga accggacaat 
catcaagcgt tttacgatac ttacgttaag actgggcagt ctttagccct tctcacaaaa 
atqaaggtaa aagatagttt gtacggtcag actgtcagat acaaaaataa agcctaccaa 

gttgattttg gcaatggcta tgaaactaag gaagtcgtca atacagtcgt tcatccagag 3180 

cctaagaaac aaaatctcaa taaggacaag gtggacatca atggcaaagc gatgcttgtc 3240 

ggctcccaaa acttctatac gctttcttgg gatttagatc agtatcgtgg cctacaagcg 33 0 0 
gataagagtc agattgctca aggcttttac tttgtggacg attatcccga agatgttctt 
cttcctgaca caaaagccat tcaaattatg acgaaagacg ggaaagctgt taaaggaatg 
qagataaaaa cctatcatca actctcagat gctccaaagg aattgcaggc agctcttgcc 
aaacgtaata tcacgcccaa aggcgctttt caagtcttta tgccaaaaga tccacaagct 
ttttataaag cttatgtgac aactggtcaa aacttgacta ttgtgaatcc gatgacggtt 

cgtgaagcag tatacaactc agggaagtct tatgataatg tggcctatca agtggacttt 3 660 

ggacaggcct atgaaacgaa tatagtgacc aatcacgtgc caacagtaaa cccgcataag 3 72 0 
tccaatacca ataaagaagg tgtgtcaata gaaggtaaga cagtccttcc gaatacggtc 
aactactato aaattgttct agattatagt cagtacaaga atatgattgt gacagatgat 
gtcctcgtca aaggttttta tatggtggat gattacccag aagaagcact tactcctcat 

ccagatggga ctcaagtgat ggatcagaac gggaaatttg tcagaggcct gtctgttcga 3 9 60 

acttacgcaa gtttggcaga tgcaccgaaa gcagtacaag aagcaatgaa atctcgaaat 4020 

ttcattccta aaggagctat tcaggtcttt caggcagatg acccaaaaac tttctttgaa 4 080 

acttatgtga aaacaggaca aaaattggtt gtgacaacac cgatgacagt gaagaatgaa 4140 

cggattcaaa caggcggtca gtatgagaat acagcttatc agattgattt tggcatcgct 420 0 

tatgtaacgg aaacagttgt gaataatgta ccaaaattag aacctcaaaa agatgtggtc 426 0 

attgacttat ctcaaaaaga taagagttta aacggaaaag caatcgcttt ggatcaggtc 432 0 

tttaactacc gcttggtagg ttcacttatc ccacgcaatc gtgcgacagc cttggtggaa 43 8 0 

tacagtttta aagatgacta cgatgaaaag cacgatgaat ataagggtgt ttataaagcc 444 0 

tatactttgc gtgatgtaac cttaaaagat ggcacagtat taaagcaggg gacagaagtg 450 0 

acgaaataca ccttacaaag tgttgataag gcaaaaggaa cgattgcgat tcgttttgat 456 0 

acagcttttt tggaaaatat agctgatgaa tcagagtttc aggctgagct ttatcttcag 462 0 

atgaaacgca ttgcttcagg tgatgttgaa aatacagtta ttcatagtgt caatggttat 4680 

aatattcgct ctaacacggt taagaccaca accccacaac cagagtcacc gactcctgat 474 0 

aacccaccgt caccccaacc gccagttcca accacagaat ccccagttca agctagtgtc 48 0 0 

ctaccaagta caggggaaag tcaatccctt ttggcgctca ttggaggagg tctcctgctfc 48 6 0 

ggcctcgctt acggactttc taaacgaaaa aaggagaaaa actaa 49 05 

SeqID 95 

atggcaaata ataattttta taatcgtgac ccttttggaa atatggatga tatttttaat 6 0 

tctctaatgg gaaatatggg gggctataat agtgagaata aacgatactt aatcaatgga 12 0 

cgggaagtta ctcctgaaga attcagtcaa tatcgtcaaa ctggaaaact ccccggtcaa 18 0 

gagctaaata accaaaatac tcctacaaac caagtctctg cagatagtgt tttgactaaa 240 

ttaggaacta atttaaccga tcaagcacgt cagcatttgc tagatccagt tattggacga 3 00 

aacaaagaaa ttcaggaaac tgcagaaatt ctagctcgtc gtactaaaaa taatcctgtt 3 60 

ctagttggtg atgcaggtgt tggtaaaaca gctgttattg aaggtttggc acaagcaatt 420 

ataaatggtg atgtccctgc tgctatcaaa aataaagaaa tcatttcgat agatatctct 480 

agccttgaag ctggcaccca atataggggt agttttgaag aaaatattca aaatatcata 540 

aaagaagtta aggaaacagg aaatattatt ctcttctttg atgaaataca ccaaatatta 6 00 

ggagccggtt caactggtgg tgatagtggc tctaaggggc tcgcagatat acttaagcct 660 

gcactttcac gtggtgaatt aacggttata ggtgcaacta ctcaagatga atatcgtaac 720 

accattctca agaatgctgc cttagcacgc cgctttaacg aagttaaagt caatgcacct 780 

tctgcacaag atactttcaa tatccttatg gggataagaa acctctatga gcaacatcat 840 

aatgttgttt tacctgactc tgttttgaaa gcagctgtag atttatctat tcaatacata 900 

ccacaaagaa gtctccctga taaagcaatt gaccttattg atatgactgc tgcacaccta 960 

gctgcgcaac atcccgtaac tgaccttaaa tccttagaaa aagaaatcgc tgcgcaaaga 102 0 

gacaagcaag aaaaagctgt caatactgaa gattttgaag aagctcttaa ggtcaagaca 108 0 

cgcattgaag agttacaaaa ccagattgat aatcataccg aaggacaaaa agtcactgca 114 0 

actatcaatg atatcgctat gtcaattgag cgtctaactg gagttcctgt atcaaatatg 12 0 0 

ggtgctagtg atattgaacg cttaaaagaa cttggcaatc gtctaaaagg taaagtcatt 1260 

ggtcaaaatg atgctgttga ggcagttgca agagctattc gccgaaatcg tgctggtttc 132 0 

gatgatggta atcgtccgat tggtagcttt ctatttgttg gaccaactgg agttggtaaa 13 80 

actgagctag caaagcaatt agcttttgat atgttcggat caaaggatgc tatcgttaga 144 0 

cttgatatgt ctgaatataa tgatcgtact gcagtttcta agctaattgg agcgactgct 1500 

ggttacgttg gttatgacga caatagtaat acactaacag aacgtatccg tcgaaatcct 156 0 

tattctattg ttcttcttga tgaaatcgaa aaagctgacc cgcaagttat cacgcttctc 162 0 

cttcaagttc tagatgatgg tcgtttaact gatggtcaag gaaacactat taactttaaa 168 0 
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aatactgtta 
agcgataaag 
aatcgtttca 
gtagatttga 
gtagatgaaa 
cgtccattgc 
gatcatacag 
gaaagataa 

SeqID 9 6 

atgggaagat 

gggaccgagc 

ctgattgagg 

gatattgtga 

tctgaaagtc 

gagcaagfcfct 

tctggggatg 

tacgatattt 

gatgagggtg 

cttattgagc 

caagatggta 

gttagcccac 

ttcttgcaag 

gagatttatg 

cttgaaaata 

actagaccag 



tcattgcaac 
acttgaaaat 
atggtgttat 
tgcttgatga 
atgttaaatc 
gccgtgtcat 
atgttaaaca 



ctcaaatgca 
tatggaacga 
tgaattctct 
agttaaccaa 
acacttaatt 
cgagcaagaa 
cctaaaagct 



42/109 

ggttttggaa 
atttctccat 
cacctaagca 
acaattggca 
gacctgggtt 
attcgagatc 
aatttgcaag 



acgaagcatt 
atttccgtcc 
aagatgactt 
aaaaaggaat 
atgacgaagc 
gcatcacaga 
atggccaaat 



tacaggtgac 
agaatttcta 
aaacgaaatt 
tgaccttgtg 
aatgggagta 
ctactatctc 
cgtcatttct 



ttaaagaact 
ttgagagtag 
atccagcagc 
cgacatcaac 
aggcagaaga 
ggaaaagttt 
ttggtcccta 
ataaggaagg 
ttgaccttct 
tacttgtcga 
agaccatctc 
aggtcgtggc 
caatagcaga 
atggggcaag 
caagtgactg 
ctgatattgc 



tttagagagt 

aggttgtgat 

tattcagacc 

ttatcaggcg 

cttgattcgc 

gacaaaagaa 

cgctgcctfct 

tctcaaaaat 

ggcacttgaa 

ggatttccct 

agatggtagt 

gctgggcatt 

gcagacggac 

ccaatcttgg 

gcaaaaactt 

tgacctctca 



9L3.a5Lcia.gc t c 
gtttctggta 
attcatgaag 
actttacaag 
ttgactgttc 
gaaaaatcag 
ttggcggacg 
ttccatcgcc 
accattccaa 
caggttgagg 
gctgttgcag 
aactgttcaa 
aagcctcttg 
caatctagcc 
ggcgcccaag 
gaacatttga 



tcatcctaca 
aactttggtc 
attatattcg 
ggctagctca 
agttggcaaa 
aaagaatcta 
gttctgagta 
atcgtattga 
atgctcagga 
cttatatgtc 
gcttggcgaa 
gtccatcgtt 
tgacctatcc 
gagatcattc 
ttgtcggagg 
cataa 



cggtgccttg 
agacaaatat 
tgctggcgct 
agttggtgtc 
agctgtgcgt 
tcctttgatt 
cacagggctc 
gcttctttta 
ggctgaggct 
attcacctct 
agctattgat 
agttgctgat 
aaactctgga 
acacacccta 
ttgttgccga 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2109 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
945 



SeqID 97 

atgaaaatag 

gatttggcta 

tcacttagta 

gatattttat 

agtatcgatc 

ttcgcaaggt 

gctaaattac 

gctaaatatg 

cttattagtc 

gatattatgg 

tcaacaaaac 

aatactagtc 

ttaaagggct 

caagaaaatt 

tcattgtatc 

caaattgcct 

gtagacggct 

aaaattacag 

ggaaatgcca 

catcaacgta 



gaattgataa 
ttgctcgcca 
ttacgccagt 
ctgatgagga 
agagtaaggc 
cattcgagat 
atgtagaaaa 
gcattaaatc 
aaaacccatc 
acttttggcg 
aatatctaga 
taagtgacta 
ttaataaaat 
ttgagcactc 
taggacttct 
tcttttcata 
accaaaataa 
taactgagta 
acttcaacac 
tctacaagaa 



gattggtttt 
agttgaccct 
tacagaagat 
caaagaaact 
ggcttctgtc 
gaaagaagca 
acatccagat 
tactggagaa 
cattctagag 
accaaattat 
tatgctaaaa 
tgctgctttt 
tctagataat 
tattacttat 
atcgttacta 
cggtagcgga 
acttcagagc 
tgagaaactc 
ctaccgtact 
aattaataat 



gcaaccagtc 
gaaaaattca 
atcgttacac 
attgatatgg 
tatgtacatc 
tgctatagcg 
tctaaagtat 
tccactcaag 
ctaaaagagg 
tctgatgttc 
actacttgga 
tgtttccata 
aacctagatg 
agtaagaaaa 
gaaaatagtc 
gctgttgctg 
gatcgtatgg 
ttctttgaaa 
ggcacttttt 
taa 



aatatgtcct 
gcaagggatt 
ttgcagcctc 
tgattgtcgc 
aacttttaga 
caactgctgc 
tagtgatagc 
gtgcaggcag 
accgtctagc 
cttatgttaa 
aagaatatca 
tcccttttcc 
aacaaaagaa 
ttggaaattg 
aaaatttaaa 
aaatttttac 
atcaactcaa 
aaacaatcct 
ctctagactc 



tgaaatgact 
attgttagat 
tgcagctaat 
tacagaatca 
gattcaacct 
tcttgattac 
ttcagatatt 
tatcgctatg 
gcaaacccgt 
tggcatgttt 
aaaacgtttt 
aaaattagct 
agctgaacta 
ctatactggc 
agcaggtgat 
cggtcagtta 
taaacgtcaa 
agacgaaaat 
tatttgcgaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1173 



SeqID 9 8 

atgaaaagtg 

caagaagtaa 

acaccttcat 

gacttggtta 

gctggtggtg 

aaagctttga 

gtaaagtccg 

gggctccttg 

tttgctttcg 

cttgcttacg 

aattcgattg 

agagtctaca 

cgtaagggag 

atttcaactt 

ggtgaccaat 



cttatatctt 
aatcttattt 
ctgttgagga 
ttcctttagg 
cttatcctac 
acatcccaca 
tagatatctg 
ctgatattgc 
taggtgatgc 
ataataatgt 
ccggtatgcc 
ctatggaaca 
attttagtca 
ttaaacgtaa 
tgcctgtaaa 



ttttaatcca 
aatagaacat 
agctgtcgcc 
gggagatggc 
tatcggacta 
agagaggaat 
taaagttaac 
tgctaatgtc 
ctaccgtata 
acggtcttta 
agccttctct 
tattcacttc 
agcaaaggag 
aaaatcagct 
agttgaagtc 



aaatctggta 
gattttcaag 
ttagcaaaaa 
actattaata 
gtacctgccg 
ctcgctcttg 
gacgattaca 
acttctgaaa 
ttaaagcgta 
cgcacgcggt 
ccagaagcaa 
ttcaaattat 
ataaaacatt 



aagatgaaca 
acgattatgt 
aagcatcaga 
aaatctgtgg 
ggaccgtcaa 
aaaaccttct 
tgattagcag 
tgaagcgtaa 
atcgatcata 
tactacttat 
caattgatga 
tgttgcatct 
ttcataccaa 



atccctaaag ttcgtattga 
atccctaaag ccttaaagtt 



agcattggca 
tcgtatcatt 
agaccatatc 
tggtgtctac 
taacttttca 
aaacggtcat 
tttaacactt 
attaggacct 
ttctattact 
taccatgacc 
tggacttttt 
aagacaattt 
taatctgact 
tggcgatcca 
tatcattcct 



60 
12 0 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
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aactcattac cataa 9 15 
SeqID 99 

atgacaacat ttactgctaa atttattgac caagagtggg aagtcccagt agaatcaggt 60 

cgctaccata tgattgttgg ggaattttgt ccttatgcac aacgtccgca gattgcacgc 12 0 

caactgcttg gtttggataa acatatttct atcagttttg tggatgatgt tccaagtgat 18 0 

ataggactta tttttagcca accagaacag gttactggtg ctaagtcgct tcgtgatatt 240 

tatcatttga ctgacccgac ctacaaaggt ccttatacta tcccaatttt aattgacaag 3 00 

actgataatc gtattgtctg taaggaatct gcagatatgc ttcgactttt tactacagat 3 60 

ttctctgatt tgcatcaaga ggatgctcct gtccttttca gccaagaaac tgcttcactt 420 

attgataatg atattaaaga cattaataat aacttccaaa gcttaatgta taagttagct 480 

tttcttgata aacaagcaga ttatgatact tatagtaagg aattcttcac ttttctagat 54 0 

caaaaagaac acttactagg acaacgtcct ttcttgctcg gagataatct aagtgaggtt 60 0 

gafcattcatt tctttacacc attagtccgc tgggatattg ctgggcgtga tcttctacta 66 0 

cttaatcaga aagcgctaga agactaccca aatatttttt cttgggctaa aactttatac 720 

aatgacttta atttaaaaac attaaccaat ccacaatcca taaaaaataa ctattactta 780 

ggcaaatttg gaagagctgt tcgtcatcat accatcgtac caacaggtcc aaatatggta 84 0 

aaatgggaaa aataa 855 

SeqID 100 

atgaaaaaga aaattatttt gaaaagtagt gtccttggtt tagtcgctgg gacttctatt 6 0 

atgttctcaa gcgcattcgc tgaccaagtc ggcgtccaag ttataggcgt caatgacttt 120 

catggtgcac ttgacaatac tggaacagca aatatgcctg acggaaaagt tactaatgct 18 0 

ggcactgctg ctcaattaga t get tat at a gatgatgetc aaaaagattt caaacaaact 240 

aaccctaatg gtgaaagcat tagagttcaa gctggtgata tggttggagc aagtccagct 3 00 

aactcagggc ttcttcaaga tgaaccaacc gttaaaacat ttaatgcaat gaatgttgag 3 60 

tatggcacat taggtaacca tgaatttgat gaaggtttgg cagaatacaa tegtategtt 420 

actggaaagg cccctgctcc agattctaat ataaataata ttacgaaatc atacccacac 48 0 

gaagctgcaa aacaagaaat tgtagtggca aacgttattg ataaagttaa caaacaaatc 54 0 

ccttacaatt ggaaacctta cgctattaaa aatattcctg taaataacaa aagtgtgaac 60 0 

gttggcttta teggaategt taccaaagac atcccaaacc ttgtcttacg taaaaattat 66 0 

gaacaatatg aatttttaga tgaagctgaa acaategtta aatacgccaa agaattacaa 72 0 

gctaaaaatg tcaaggctat tgtagtcctt gctcatgtac ctgcaacaag caaggatgat 78 0 

attgetgaag gtgaagcagc agaaatgatg aaaaaagtca atcaactctt ccctgaaaat 84 0 

agegtagata ttgtctttgc tggacacaat catcaatata caaatggtct tgttggtaaa 9 00 

actegcattg tacaageget ctctcaagga aaagcctatg ctgacgtacg tggtgtccta 9 60 

gatactgata cacaagattt cattgaaacc ccttcagcta aagtagttgc agttgctcct 102 0 

ggtaaaaaaa caggtagtgc cgatattcaa gccattgttg accaagctaa tactategtt 108 0 

aaacaagtaa cagaagctaa aattggtact gecgaggtaa gtggcatgat tacgegttet 114 0 

gttgatcaag ataatgttag tccagtaggc agcctcatca cagaggctca actagcaatt 12 0 0 

gctcgaaaaa getggecaga tatcgatttt gecatgacaa ataatggtgg cattegtget 1260 

gacttactca tcaaaccaga tggaacaatc acctggggag ctgcacaagc agttcaacct 13 2 0 

tttggtaata tcttacaagt cgtcgaaatt actggtagag atctttataa agcactcaac 13 8 0 

gaacaatacg accaaaaaca aaatttcttc cttcaaatag ctggtctgcg atacacttac 144 0 

acagataata aagagggegg agaagaaaca ccatttaaag ttgtaaaagc ttataaatca 15 0 0 

aatggtgaag aaatcaatcc tgatgcaaaa tacaaattag ttatcaatga ctttttattc 15 60 

ggtggtggtg atggctttgc aagcttcaga aatgecaaac ttctaggagc cattaatccc 162 0 

gatacagagg tatttatggc ctatatcact gatttagaaa aagctggtaa aaaagtgagc 168 0 

gttccaaata ataaacctaa aatctatgtc actatgaaga tggttaatga aactattaca 174 0 

caaaatgatg gtacacatag cattattaag aaactttatt tagatcgaca aggaaatatt 18 0 0 

gtagcacaag agattgtatc agacacttta aaccaaacaa aatcaaaatc tacaaaaatc 18 60 

aaccctgtaa ctacaattca caaaaaacaa ttacaccaat ttacagctat taaccctatg 192 0 

agaaattatg gcaaaccatc aaactccact actgtaaaat caaaacaatt accaaaaaca 19 8 0 

aactctgaat atggacaatc attccttatg tctgtctttg gtgttggact tataggaatt 204 0 

gctttaaata caaagaaaaa acatatgaaa taa 2073 

SeqID 101 

atggcatata tttggtctta tttgaaaagg taccccaatt ggttatggct tgatttacta 6 0 

gga.gcta.tgc ttfcttgtgac ggttatccta ggaatgecca cagccttagc gggtatgatt 120 

gataatggcg ttacaaaagg tgateggact ggagtttatc tgtggacgtt catcatgttt 18 0 

atatttgttg tactaggtat tattgggcgt attacgatgg cttacgcatc tagtegctta 24 0 

acgacaacaa tgattagaga tatgegtaat gatatgtatg ctaagcttca agaatactcc 3 00 

catcatgaat atgaacagat aggtgtatct tcactagtga caegtatgae aagegatact 3 60 

tttgttttga tgcaatttgc tgaaatgtct ttacgtttag gectagtaac tcctatggta 420 

atgattttta gcgtggttat gatactaatt acgagtccat ctttggcttg gcttgtagcg 48 0 
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gttgcgatgc ctcttttggt aggagtcgtt ttatatgtag ctataaaaac aaaaccttta 540 

tctgaaagac aacagactat gcttgataaa atcaatcaat atgttcgtga aaatttaaca 600 

gggttacgcg ttgttagagc ctttgcaaga gagaattttc aatcacaaaa atttcaagtc 66 0 

gctaaccaac gttacacaga tacttcaact ggtcttttta aattaacagg gctaacagaa 72 0 

ccacttttcg ttcaaattat tattgcaatg attgtggcta tcgtttggtt tgctttggat 78 0 

cccttacaaa gaggtgctat taaaataggg gatttagttg cttttatcga atatagcttc 84 0 

catgctctct tttcattttt gctatttgcc aatcttttta ctatgtatcc tcgtatggtg 90 0 

gtatcaagcc atcgtattag agaggtgatg gatatgccaa tctctatcaa tcctaatgcc 960 

gaaggtgtta cggatacgaa acttaaaggg catttagaat ttgataatgt aacattcgct 102 0 

tatccaggag aaacagagag tcccgttttg catgatattt cttttaaagc taagcctgga 1080 

gaaacaattg cttttattgg ttcaacaggt tcaggaaaat cttctcttgt taatttgatt 114 0 

ccacgttttt atgatgtgac acttggaaaa atcttagtag atggagttga tgtaagagat 12 00 

tataacctta aatcacttcg ccaaaagatt ggatttatcc cccaaaaagc tcttttattt 12 60 

acagggacaa taggagagaa tttaaaatat ggaaaagctg atgctactat tgatgatctfc 13 2 0 

agacaagcgg ttgatafcttc tcaagctaaa gagtttattg agagtcacca agaagccttt 13 8 0 

gaaacgcatt tagctgaagg tgggagcaat ctttctgggg gtcaaaaaca acggttatct 1440 

attgctaggg ctgttgttaa agatccagat ttatatattt ttgatgattc attttctgct 15 0 0 

ctcgattata agacagacgc tactttaaga gcgcgtctaa aagaagtaac cggtgattct 1560 

acagttttga tagttgctca aagggtgggt acgattatgg atgctgatca gattattgtc 162 0 

cttgatgaag gcgaaattgt cggtcgtggt acccacgctc aattaataga aaataatgct 168 0 

atttatcgtg aaatcgctga gtcacaactg aagaaccaaa acttatcaga aggagagtga 1740 

SeqID 102 

atgaaaaaag gacaagtaaa tgatactaag caatcttact ctctacgtaa atataaattt 60 

ggtttagcafc cagtaatttt agggccattc ataatggtca caagtcctgt ttttgcggat 120 

caaactacat cggttcaagt taataatcag acaggcacta gtgtggatgc taataattct 18 0 

tccaatgaga caagtgcgtc aagtgtgatt acttccaata atgatagtgt tcaagcgtct 240 

gataaagttg taaatagtca aaatacggca acaaaggaca ttactactcc tttagtagag 3 00 

acaaagccaa tggtggaaaa aacattacct gaacaaggga attatgttta tagcaaagaa 3 60 

accgaggtga aaaatacacc ttcaaaatca gccccagtag ctttctatgc aaagaaaggt 420 

gataaagttt tctatgacca agtatttaat aaagataatg tgaaatggat ttcatataag 48 0 

tctttttgtg gcgtacgtcg atacgcagct attgagtcac tagatccatc aggaggttca 540 

gagactaaag cacctactcc tgtaacaaat tcaggaagca ataatcaaga gaaaatagca 6 00 

acgcaaggaa attatacatt ttcacataaa gtagaagtaa aaaatgaagc taaggtagcg 6 60 

agtccaactc aatttacatt ggacaaagga gacagaattt tttacgacca aatactaact 720 

attgaaggaa atcagtggtt atcttataaa tcattcaatg gtgttcgtcg ttttgttttg 780 

ctaggtaaag catcttcagt agaaaaaact gaagataaag aaaaagtgtc tcctcaacca 840 

caagcccgta ttactaaaac tggtagactg actatttcta acgaaacaac tacaggtttt 900 

gatattttaa ttacgaatat taaagatgat aacggtatcg ctgctgttaa ggtaccggtt 960 

tggactgaac aaggagggca agatgatatt aaatggtata cagctgtaac tactggggat 1020 

ggcaactaca aagtagctgt atcatttgct gaccataaga atgagaaggg tctttataat 1080 

attcatttat actaccaaga agctagtggg acacttgtag gtgtaacagg aactaaagtg 114 0 

acagtagctg gaactaattc ttctcaagaa cctattgaaa atggtttagc aaagactggt 1200 

gtttataata fctatcggaag tactgaagta aaaaatgaag ctaaaatatc aagtcagacc 126 0 

caatttactt tagaaaaagg tgacaaaata aattatgatc aagtattgac agcagatggt 132 0 

taccagtgga tttcttacaa atcttatagt ggtgttcgtc gctatattcc tgtgaaaaag 13 8 0 

ctaactacaa gtagtgaaaa agcgaaagat gaggcgacta aaccgactag ttatcccaac 144 0 

ttacctaaaa caggtaccta tacatttact aaaactgtag atgtgaagag tcaacctaaa 150 0 

gtatcaagtc cagtggaatt taattttcaa aagggtgaaa aaatacatta tgatcaagtg 1560 

ttagtagfcag atggtcatca gtggatttca tacaagagtt attccggtat tcgtcgctat 162 0 

attgaaattt aa 1632 

SeqID 103 

atggaaaatt ggaagtttgc cctatcatcg attctagggc acaagatgcg tgcttttctg 60 

accatgttag ggatcatcat tggggttgca tcagttgtgt taattatggc cttaggtaag 12 0 

ggcatgaaag attcggttac aaatgagatt acaaagtctc agaagaacct tcagatttat 18 0 

tataaaacaa aagaagacca aaaaaatgag gataactttg gagctcaagg ggctttcatg 24 0 

caaggtagtg ataccaaccg taaggaacca attattcaag aatcatggtt aaaaaagatt 3 00 

gctaaggaag tagatggtgt tagtggctac tatgtaacca atcaaacaaa tgctccagta 360 

gcttaccttg aaaaaaaggc aaagacggtt aatataacag gagttaaccg cacttacctt 420 

ggtatcaaaa agttcaaaat taaaagtggt cgtcaattcc aagaggagga ttataaccaa 48 0 

ttctcgagag ttattttact agaagaaaaa ttagctcaga gattattcca gacaaatgaa 54 0 

gcagcgttaa ataaggtggt gactgtaaaa aataaatctt acttagtagt gggggtfctat 60 0 

tcggatccag aagctggttc aggcttatat ggaagtaatt ctgatgggaa tgctatttta 66 0 

accaatacac agttagcttc ggagtttggt gctaaagaag cagaaaacat atattttcat 720 
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cttaatgatg tatcgcaatc caatcgcatt ggtaaagaaa taggcaaacg tttaacagac 780 

attagtcatg ctaaggatgg gtactatgat aattttgaca tgacatctat agttaaatct 840 

atcaataccc aagttggtat tatgacaggt gttattggag cgattgcagc aatttctctc 9 00 

ctagttggtg gtatcggagt gatgaatatt atgttagfcct ctgtaactga gcgcacgaga 960 

gaaattggcc ttcgtaaagc ccttggggca actcgacgta aaattttagc tcaattttta 1020 

attgaatcaa tggtcttaac gattttgggt ggattaattg ggttattact agcctatgga 10 8 0 

ggtacgatgt taattgctaa tgcgcaagat aagattactc cttctgtgtc acttaacgtg 1140 

gctataggca gtttaatttt ctcagcattt attggaatta tatttggttt attacctgct 12 00 

aataaagcca gcaaattaaa cccaattgat gctttacgat atgaataa 124 8 

SeqID 104 

atgcagtatt cagaaattat gattcgttat ggagaactct ctactaagaa gaaaaaccgt 60 

atgegcttca tcaataagtt aaaaaataat atggagcatg tactctccat ttatccagat 12 0 

gtttcagtaa aaacagatcg tgatagagga catgtatatc tcaatggtac agattatcat 18 0 

gaagttgcag agtccttaaa agagattttt ggtatccaag ctttttctcc atcttttaaa 240 

gtagaaaaaa atgttgatac attggtaaaa gctgtccagg aaattatgac ttccgtttat 3 00 

aaagatggga tgacttttaa aattaccgca aaacgtagtg accactcatt tgaattggat 3 60 

agccgtgctc taaatcatac tttaggagat gccgtttttt cagtcttgcc aaatattaag 420 

gctcagatga agcaaccaga tatcaatctt aaagtcgaga tacgagatga ggctgcttat 48 0 

atttcatatg agaatattag gggtgcagga ggattaccgg taggaacatc tggaaaaggg 540 

atgctgatgt tgtctggtgg gattgattct ccggtggcag gttacctagc gttaaaacgt 600 

ggtgtagata tagaagcagt tcattttgca agtcctcctt atactagccc aggtgcattg 660 

aaaaaagcac atgatttaac acgtaaattg acaaaatttg gtggtaatat tcaatttatt 720 

gaagttccat tcacagaaat tcaagaggaa attaaggaaa aagctcctga agcctacttg 780 

atgacgttaa cacgtaggtt tatgatgcgt attacagatc gtattcgtga gaaccgaaat 84 0 

ggtcttgtta ttattaacgg tgaaagttta gggcaggtgg caagccaaac gctagaaagt 900 

atgcaagcca ttaatgctgt cactgcaaca ccgattattc gtcctgtggt cacgatggat 960 

aagctagaaa ttattgatat tgctcaaaaa atagatactt ttgatatttc aattcaacca 1020 

tttgaggatt gctgtacgat ttttgcacca gatcgcccaa aaactaaccc taaaattaag 108 0 

aatacagaac agtatgagaa acgtatggat gtagaaggtc tggtagagag ggcagttgca 114 0 

gggattatgg taactactat tcaacctcaa gcagatagtg atgatgttga tgacttgatt 120 0 

gacgatttat tataa 1215 

SeqID 105 

atggctcgtg ttaaaggtgg agttgtttca cgtaaacgtc gtaaacgcgt acttaaatta 6 0 

gctaaaggtt actatggtgc aaaacatatc ttgttccgta ctgcaaaaga acaagtaatg 12 0 

aattcttact actatgcata ccgtgaccgt cgtcagaaaa aacgtgactt ccgtaaatta 18 0 

tggatcactc gtatcaatgc ggctgctcgt atgaatggtt tatcatattc gcaattgatg 24 0 

cacggtttga aattagctga aatcgaagtt aaccgtaaaa tgcttgctga tttagcagtt 3 00 

aacgatgcag cagctttcac agctcttgca gatgcagcta aagctaaact tggtaaataa 360 

SeqID 106 

atgtttaaag cttcaaaaaa actcgttcaa aaaaataagt caaatcattt ttggcttgtt 6 0 

tttttcataa cacttatact atttcttatt ggttgttacg ccagcttacg atttggtgct 12 0 

atcaatttca agactagtga tttaataact gttttaaaaa atcccttaaa aaactctaat 18 0 

gctcaggatg ttatttttga cattagacta ccacgcataa ttgcagctat cttagtaggt 24 0 

gcagctatgt cacaagcagg tgctattatg cagggagtca ctcgaaacgc cattgctgat 3 00 

cctggactac taggtattaa cgccggagct gggttagccc ttgttgttgc atatgctttt 3 60 

ttagggagca tgcactatag tacgattctc attgtctgct tactagggtc tgtcatttct 42 0 

tatcttttag tcttcacact ctcttataca aagcaaaagg gttaccacca actacgactc 48 0 

attttagcfcg gagcgatgat ctccacactt tttacgtctg ttggacaagt ggttacatta 54 0 

tactttaaac tcaatcgaac tgttatcggt tggcaagcag gtggtttatc ccaaattaac 60 0 

tggaaaatgc taatcataat agcacctatc attattttag gtttacttat ttcacaactc 660 

ctagcacatc aattaactat cttgagcctt aatgaatctg tggccaaagc attgggacaa 720 

aaaactcaat taatgactgc gtttctatta cttattgttt tgtttttatc ggcttcatca 78 0 

gtcgcgttga taggaaccgt ttctttcatt ggattgatta ttccacattt tattaagctt 84 0 

tttattccca aagattatcg tctgttatta cctttaatag gattttcagg ggctactttc 900 

atgatctggg tagacttatc atctcgtatt ataaatcccc cttcagagac accaattagt 960 

tctataatca gtatcgtagg gcttccatgt ttcctatggt taattagaaa agggaaaaat 102 0 

ctatga 102 6 

SeqID 107 

atgattaaaa gaccaattca tctttctcat gattttttag cagaagttat tgataaagaa 60 

gcaataacgc ttgatgcaac tatgggaaat gggaatgata ctgttttttt agcgaaaagt 12 0 

tctaaaaaag tttatgcatt tgatattcaa gaagaagcaa tagcaaaaac gaaagcaaag 18 0 
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ctcacggagc aagggatttc taacgctgaa cttatcttgg atgggcatga aaatttggaa 240 

cagtatgttc atacaccttt aagagcggct atatttaatc tgggatactt accctcagcg 3 00 

gataaaacag ttattacaaa acctcacaca actatcaagg caafccaaaaa cgttttagac 3 60 

attttagaag taggtggacg tctatcactt atggtttatt atggtcatga tggtggaaaa 420 

agtgagaaag atgcggtgat agcattcgtg gaacaattac cacagaataa tttcgctaca 480 

atgctttatc aaccactcaa tcaagttaat acgcccccct ttttaattat ggtagaaaag 54 0 

ttacaatcat atgaaaatga ggtgtaa 567 

SeqID 108 

atgcgagtac atattacaag tatatatggg cagtcaccgc gtagcattgc ccttatatca 6 0 

caaaaactag ttaaggatgt cggacgtcaa cttggttatg atgaaatggg aatctacttt 12 0 

tataatgatc atgctgaaac gcatggtgaa agaagtacta ggatggatgg tatcattgcg 18 0 

ggattaggca gaggggatat tgttgttttt caagtaccta catggaattc aacagaattt 24 0 

gatgaacttt ttctagataa attacaggca tacggtgcta gaattataac ttttgtacat 3 00 

gatattgttc ccttaatgtt tgagtctaat ttttatttat tagatagagt aattgacatg 360 

tataatcgtt cggatgttgt tattctacct acaaaagcga tgcatgatta tctaattgaa 42 0 

aaagggatga ctacctctaa agttctttat caagaagtat gggatcatcc agttaatata 48 0 

gacctccctc gtccagaatg tcaaaaagtc cttagttttg caggcgatat tcaacgcttt 54 0 

ccctttgtca atgactggaa ggaaaatatc ccgctcatct attatggaga cggaagcaga 600 

ctcaattctg aggcaaatgt tcatgctcag ggatggaaag atgatgtaga attgatgcta 66 0 

agtctttcta agcggggagg atttggcctg tgttggagtg aagatagaga ggagttagtc 72 0 

gagcgtcgtt attcgaggat gaatgcctcg tataaattat ctactttctt ggctgctgga 78 0 

cttccaatta ttgctaacca tgatatttca agtagagact ttatcaaaca acacggatta 84 0 

ggatttacag ttgaaacgct agaagaagct gttgaaaaaa ttaataatat ggaaaaagag 900 

acttatgaca gctatgtgga aaafcgttgag aagatagcaa ctctcttgcg aaatggttat 96 0 

atcactaaaa aattgttgat agatgctgtg catatgcttt atagatag 1008 

SeqID 10 9 

atgtcccaaa agacttttgg caagcagtta acagttgtag atactaagag tagagtcaag 6 0 

atgcataaat caggaaaaaa ctgggtaaga acagtaatgt cgcattttaa tctatttaaa 120 

gcgattaaag ggagagcaac tgttgaagca gatgtgtgta ttcaagatgt tgaaaaagaa 18 0 

gaccgactat cttcaggaaa tttgacctat ctcaaaggaa tactagctgc cggagctctg 240 

gtaggtggag cgagtttaac cagtcgtgtt tatgcagatg agactccagt tgttcaagaa 3 00 

caatcaagtt ctgtaccaac actggcagaa caaacggaag tgactgttaa aacaactact 3 60 

gttcaaaatc atcaagatgg gacagtatcg aaaaacatta ttgattctaa tagtgtatct 42 0 

atgtcagagt cagcctcaac aagtactagt gaatctgtaa gtatgtctat gtcagggtca 48 0 

actttaacaa gtgtaagtga atctgtaagt acatctgctt taacaagtgc ttcagaatcg 54 0 

ataagcacgt cagcctcaga aagtgtttca aaatctacaa gtattagtga ggtttcaaat 600 

attcttgaaa ctcaagcttc tttaactgat aaaggaagag agtcgttttc ggcaaaccag 660 

atagtaacag aaagtagctt agttactgat gctggtaaaa atgcttcagt atctagccta 720 

attgaaatta caaaaccaaa atcggagtta cagacttcca aaatgtcaaa tgagtcgctt 78 0 

ataactccag agaaatccca agtaatgatt gcaagcgata aaactgggaa tgagagtcta 840 

actccgacaa ttagattaaa atcagttatt cagccaagga gtatgaactt gatgactttg 9 00 

agttcggaga tggacttgat accactagaa gaagtgtctg atactgaaat gttaggtaaa 960 

gatgtatcaa gcgagttgca gaaagttaat attgcgttaa aagataacac tcttagtgag 1020 

cctggaacag ttaaattaga tagttcagaa aaccttgttt tgaactttgc cttttcaatc 108 0 

gcttctgtta acgagggaga tgtctttact gtaaagcttt ctgataacct tgacacacaa 1140 

gggattggta ctattctaaa agttcaagat ataatggatg aaacggggca gttattagcg 1200 

actgggtcat atagtccttt aacacataat attacataca cctggacaag gtatgcttct 1260 

acgttgaata atattaaagc tagagtcaat atgccagttt ggcctgacca gagaataatt 1320 

tctaaaacaa cttcagataa gcagtgcttt actgcaacat tgaacaatca agttgcttca 13 80 

attgaggaac gtgttcagta taatagtcct tcagtgacag aacatactaa tgttaagaca 144 0 

aatgtaagat ctcggatcat gaagcttgat gatgaaagac agacagaaac t tat at tact 1500 

caaattaatc ctgaaggtaa ggaaatgtat ttcgcatcag gacttgggaa tctatatact 1560 

attatcggtt cagatggaac atcaggttca ccagttaatt tattaaatgc ggaagtaaag 1620 

attctaaaaa ctaattcaaa aaatcttaca gatagtatgg atcaaaatta tgattcgcct 1680 

gagtttgaag atgtgacttc ccagtatagt tatactaacg atggttctaa aattaccata 1740 

gattggaaaa caaattctat ttcttccact acatcttatg ttgttttggt caaaatacct 18 00 

aaacaaagtg gtgtattgta ttcaactgtt tctgatataa atcaaacata tggttctaaa 18 60 

tattcttatg ggcatacgaa tataagtggt gactcagatg cgaatgccga aattaaactt 1920 

ttatcagaaa gtgcttctac gagtgcgtcg acgtcagcaa gtaccagcgc ttccatgagt 19 80 

gcctcgacat cagcaagtac cagcgcttcc atgagtgcct cgacatcagc aagtaccagc 2 040 

gcttccatga gtgcgtcgac gtcagccagc accagcgctt caaccagcac ctcaacgagt 2100 

gcctcgacat cagccagcac aagtgcttca acaagtgcaa gtatgagtgc ttcaacaagt 2160 

gcaagtacga gtgcatccac gtcagcaagt actagcgctt ccacaagtgc cagcaccagc 2220 
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gcctcaacga gtgcctcaac gtcagccagc actagcgcct caacgtcagc cagcaccagc 22 80 

gcctcaacca gtgcttcaac cagtgcttcc acatcagcca gcactagcgc ctcaatgagt 2340 

gcgtcgatgt cagccagcac aagtgcttca atgagtgcct cgacgtcagc cagcactagc 24 00 

gcctcaacca gtgcttcaac cagtgcttca accagtgctt ccacctcagc aagtatgagc 24 60 

gcctcaacaa gtgcaagtac tagcgcttca acaagtgcaa gtatgagcgc ctcaaccagt 2520 

gcaagtacta gtgcttcaac aagtgccagt accagtgcct cgacgtcaac aagtactagc 2580 

gcttccacgt cagcaagtac cagtgcatcc acatcagcaa gtatgagcgc atccacgtca 2 640 

gccagcacca gcccctcaac cagtgcttca accagtgctt caaccagtgc ttcaaccagt 2700 

gcttccacat cagccagcac tagcgcctca atgagtgcat ccacgtcagc aagtactagc 2 760 

gcctcaatgt cagccagtac tagcgcctca accagtgcca gtaccagtgc ttccacctca 2 820 

gcaagtatga gcgcctcaac aagtgcaagt actagcgctt ccacaagtgc aagtatgagc 2 880 

gcctcaacca gtgcaagtac tagtgcttca accagtgcca gtaccagtgc ctcgacgtca 2940 

acaagtacta gcgcttccac gtcagcaagt accagtgcat ccacatcagc aagtatgagc 3 0 00 

gcatccacgt cagcaagtac cagtgcatcc acgtcagcaa gtatgagcgc ctccacaagt 3 060 

gcaagtatca gtgcatccac gtcagcaagt atgagcgctt ccacaagtgc aagtaccagt 3120 

gcatcgacgt cagccagtac tagcgcctca atgagtgcat cgacgtcagc cagcaccagt 3180 

gcttccacaa gtgcaagtac tagcgcctca atgagtgcat cgacatcagc aagtaccagc 3240 

gcttccacaa gtgcaagtac gagtgcttcc acatcagcaa gtactagcgc ctcaacatcg 3 3 00 

gcaagtacaa gttcttccac aagtgcaagt accagtgcat cgacatcagc cagcactagc 3 3 60 

gcctcaatga gtgcctcgac gtcagccagc acaagtgctt ccatgagtgc gtcgacgtca 3 420 

gccagcacta gcgcctcaac gagtgcgtca atgtcagcca gcacaagttc ttcaacaagt 3480 

gcctcgatgt cagccagcac tagcgcttca atgagtgcct cgacgtcagc tagcactagc 3 540 

gcctcaacga gtgcgtcaat gtcagccagc acaagttctt caacaagtgc ctcgatgtca 3 600 

gccagcacta gcgcttcaat gagtgcgtcg acgtcagcca gcaccagcgc ttcaacgagt 3 660 

gcgtcaatgt cagccagcac tagcgcttca atgagtgcca cgacgtcagc cagcaccagc 3 720 

gtctcaacga gtgcatcgac atcagcaagt accagcgctt ccacaagttc ttcaagctca 3 78 0 

gtgacttcta attcatcaaa agagaaggtg tattctgcct taccttctac gggtgaccaa 3 840 

gattattctg taactgctac tgccttaggt ttaggtttaa tgactggtgc aacccttttg 3 9 00 

ggacgaaaaa aatctaaaaa agataaagac taa 3 933 

SeqID 110 

atgatagata gaaaagatac taaccgattt aaattagttt ccaaatatag cccttcggga 60 

gatcaacctc aagctattga aacgttggtt gacaatattg aaggaggaga aaaagctcag 120 

attttaaaag gtgctactgg tactggtaaa acctacacaa tgagccaagt tatcgcccaa 180 

gttaacaagc caaccttggt gattgcacat aataaaacct tagcaggtca actttatgga 240 

gagtttaaag aattttttcc tgacaatgcc gtagagtact ttgtttcata ttatgattat 3 00 

tatcaaccag aagcttatgt gccttcttcg gatacttata ttgaaaaaga tagttcagtc 3 60 

aatgacgaaa ttgacaaatt acgtcactca gcgacttctt cgttattaga aagaaatgac 420 

gttattgttg tagcatctgt ttcttgtatt tacggtttag gttctcctaa agagtatgca 480 

gacagtgttg ttagccttag gccaggtcaa gaaatttcac gtgatcagtt attgaacaat 540 

ctcgttgata ttcaatttga gcgtaacgat attgattttc aacgtggaaa atttcgcgtt 6 00 

cgtggcgatg ttgtagaagt ttttcctgcg agtcgtgatg agcatgcctt tcgaattgaa 660 

ttttttggtg atgaaattga tcgtataaga gaaatagaat cgttaacagg gcgtgtgcta 72 0 

ggagaagtgg aacatttagc tattttccca gcaacacact ttatgacaaa tgatgaacat 780 

atggaagaag ctatttcaaa aatacaagct gaaatggaaa atcaagtaga actttttgaa 840 

aaagaaggga aattgattga agcgcaacgt attcgacaaa ggacagaata tgatattgaa 9 00 

atgcttcgtg aaatgggata tacaaatggt gttgagaact actcacgtca catggatggt 960 

agaagtgaag gtgagccacc ttttactttg ttagatttct tcccagaaga ttttttgatt 102 0 

atgattgatg agagtcacat gacaatgggg caaatcaaag ggatgtataa tggagaccgt 108 0 

tcgcgtaaag aaatgttggt taattacggt ttccgtttac catctgcctt agataaccgt 114 0 

ccacttcgtc gtgaagaatt cgaaagccat gtccatcaaa ttgtctatgt atctgcaacg 12 00 

ccaggagact atgaaatgga gcaaacagat acagttgttg agcaaattat ccggccaaca 126 0 

ggacttcttg atccagaagt tgaagttaga ccaagtatgg ggcagatgga tgacttgctt 13 2 0 

ggtgaaatca atttgaggac tgaaaaaggt gagcgtacct ttattacaac attgacaaag 13 8 0 

cgtatggcag aagacttaac agattatctc aaagaaatgg gagttaaagt taagtatatg 144 0 

cacagcgata ttaaaacctt agaacgtaca gagattattc gtgatttacg acttggtgtg 15 0 0 

tttgatgttt taattggtat taatctctta cgtgaaggaa tagatgttcc tgaggtttcc 15 60 

cttgttgcaa tcttagatgc tgataaggaa ggtttccttc gtaatgagcg tggtcttatc 162 0 

caaacaatcg gacgtgcagc acgtaacagt aatggtcatg ttattatgta tgctgataaa 168 0 

ataacagatt ctatgcaaag agctatggat gaaaccgctc gccgtcgtcg tttacaaatg 174 0 

gattataatg aaaagcatgg aattgtgcca caaacaatta aaaaagaaat ccgtgattta 18 0 0 

attgctatta ctaaaagtaa tgacagcgat aagcctgaaa aagttgtcga ttacagtagt 1860 

ttatctaaaa aagaaaggca agctgaaatt aaagctcttc aacagcaaat gcaagaagca 192 0 

gctgaattat tggactttga gctagctgca caaatccgag atgttatctt agaactaaaa 19 8 0 

gcaatcgatt aa 1992 
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SeqID 111 
atgaagcaca 
ggaaaggtca 
tttaaagatt 
aagcgtgctg 
gttcaatctg 
aaagtcttta 
aataataaag 
acagctgctc 
tttgatacaa 
gacgatcaac 
gaaggcgagg 
ctaattaaag 
atcatggata 
gccaatgaaa 
gcaccattcg 
aagaaaattg 
gctttaaatg 
gaagcacgcc 
gcggttaaaa 
gctaaaaatg 
catgttaaag 
gatgctctaa 
gaaacaccta 
aatccagaat 
tacgataaac 
gctaaacctg 
ctatctggta 
attggtatta 
atgatttttg 
tggggtattc 
gtfcgctgcta 
ggtggtattg 
tacggcaaaa 
tttatcaacc 
gtggaactct 
tatgctatcc 
cgtttagaaa 



agttaaaagc 
gtgcagaaac 
ctgatcaaac 
gctggaatgt 
gacaggcaga 
afcttctcaga 
tcacaaacta 
aaagcttctt 
gtgacctaat 
cagttgtgca 
cagttggtag 
aatttaacac 
aatggcttgg 
aagcaactcc 
aatatcaaaa 
ctaaacagca 
ctgtccaatc 
aaaaaatctt 
aaggaagcaa 
gtactgcctc 
catttgatga 
tggatgacga 
tcaaaggtga 
taattaaaat 
ttgttaaaaa 
tagatgaatc 
ttggaactac 
tctttggtat 
ttgatattgt 
ctaatttaat 
ctatcgctct 
aagctgttcc 
ctatgcaaaa 
aatttgtcat 
tccaaactgg 
ttgcaattat 
agaggcttaa 



ttttacgctt 
tgtgaatatt 
ttataaagga 
taacatgacg 
cgcgctaatg 
tacttattac 
caaacaacta 
agaggaaaat 
gaataacagc 
atttgcgata 
ctttgcattt 
agcttttgca 
gaaagacgct 
tgtaaagcca 
cggtaaaggg 
aggttttaaa 
tggacaagct 
tgatttttct 
tgtcaaatca 
atatacttgg 
agcatctaca 
agcagttctt 
aaaatcaggc 
gtttaacaac 
atacctttcc 
aactatttta 
tttaagttta 
gatgagcgta 
ccgtggtatt 
cgaaagcatc 
ttctttaaat 
ttctggtcaa 
ggttatcttg 
ctcattaaag 
taaaatcata 
gtatcttgta 
ataa 



gctttactct 
gtttctgata 
atcgatgttg 
tacccaggtt 
gccggaacta 
gatacttccg 
aaaggaaaag 
aaatctaaat 
cttgattctg 
aatcaaggaa 
gctgtcaaaa 
caaatgaaat 
acaaaaacaa 
agttabaaaa 
aaatatactg 
cttgatatct 
gacggtgtta 
gatccttatt 
taccaagatt 
ttatcagacc 
atgtatgata 
gcttacgcta 
gatatcggat 
ggtcttgctt 
acagccagca 
gggttaattt 
actcttatct 
tcaccaagta 
ccactcatga 
acaggtcacc 

ggtggtgcgt 
atggaagcaa 
cctcaagcag 
gatacaacaa 
attgcacgta 
atcattaccc 



caatattctt 

cagcatacgc 

acatcgttaa 

ttgatgccgc 

ctgttactga 

ttattcttta 

tagtcggtgt 

acggctataa 

gttctattta 

aagcttacgc 

aaggtagtgg 

cagatggcac 

gcggcaaagc 

ttgtttctga 

gttttgatat 

caaatccagg 

ttgcaggagc 

acacatctag 

taaaaggaaa 

acgcagataa 

gtatgaactc 

ttaatcaagg 

ttgcagtgaa 

cactcaaaaa 

cctcttcaaa 

ctaataacta 

catttgcgat 

atactctccg 

ttgtggccgc 

aaagtccaat 

acattgctga 

gtcgtagttt 

tacgccttat 

ttgtatcagc 

actatcaatc 

ttctcacccg 



tgtgtttggt 
tccattcgaa 
cgaagtcgct 
agttaacgct 
agcacgtaaa 
tactaaaaat 
aaaaaatgga 
agttaaaaca 
cgccgctatg 
cattaacatg 
acacgataac 
ttataatgac 
aacaggtaat 
ttcttcattc 
ggaattaatc 
ttttgatgcc 
tacaatcaca 
cgttatctta 
aacagttggt 
gtacaactat 
aggttcaatt 
tcgtaaattt 
aaaaggggca 
atcgggtgag 
cgataaagct 
caaacaattg 
tgctatggtt 
cacaatttca 
ttttattttc 
taatgacttc 
aattgtacgt 
aggtatttct 
gttaccaaac 
aatcggactt 
attccgtatg 
tctagcaaaa 



SeqID 112 

atgggagata aaccgatatc atttcgtgat aaagatggga attttgtttc agcggccgat 
gtctggaatg ctgaaaaatt agaagaactc tttaacacat taaatcctaa tcgtaaactt 
cgtttagaac gcgaaaaatt agcaaaagaa aaaggatag 



SeqID 113 

atggaaacat 

caactaactg 

actcatattg 

gacagtggtt 

gactatcatt 

attgatcttc 

aaagctggga 

aaagctattg 

gaaataaaac 

gttcgcttct 

gctatttttg 

caagtagctc 

cccaaaaata 

actaccttaa 

taa 



atacfcctatc 
aaggtgaaga 
atactgctca 
tggcgcgaga 
tagccaaagc 
ttttgattca 
atgcaggcac 
gtgtttcgaa 
ctatggttaa 
gtaaaggaaa 
aaaatgaatc 
ttcgctggtc 
ttgaagcaaa 
tacaactaga 



gaatactctt 
agcttataag 
gatttatggc 
aagtatcttt 
ttctattgat 
ttggccaaat 
ttggaaagcg 
tttcatgaaa 
tcaaattatc 
tgatatctta 
tataaaagct 
acttgataat 
ccttgatatt 
ttcaggtatt 



aacattccta 
gcagtgactc 
aacgaacaca 
ttgacaacta 
gaatctctgc 
ccaaaagctc 
atggaagaag 
catcacttag 
ctagcaccag 
ttagaagctt 
attgctgaaa 
ggctttttac 
tttgattttc 
aaacccaaag 



aaattggttt 
atgctttaaa 
gtgttggacg 
agatttggaa 
agaaatfcagg 
ttcgtgaaaa 
cttataaaga 
aggctttatt 
gttgtgctca 
acagtccatt 
aatatggtaa 
cattacctaa 
aattaaacga 
atccagataa 



tgggacatgg 
agttggttat 
tgctataaga 
tgacaaacat 
tgttgattat 
tgatgcgtgg 
aggaaaggta 
tgaaactgcc 
agaggatttg 
tggaacaggt 
atcagttgca 
gtcagcgact 
agatgatatt 
tgtctcattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2184 



60 
120 
159 



60 
12 0 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
843 



SeqID 114 

atggttaaag gtcaaggaaa cgttgtagca caaaataaaa aagcgcatca cgactatact 
atcgttgaaa caatagaagc aggaattgtt ttgacaggaa cagaaatcaa aagtgtccgc 



60 
120 
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gcagcgcgta ttacattgaa agatggatat gcacaaataa aaaatggtga agcgtggttg 180 

attaatgttc atattacacc atacgatcaa ggtaatatat ggaatcaaga tcctgatcgt 240 

acacgaaagt tattactaaa aaaacgtgag attgaaaaaa tatcaaatga actgaaaggg 3 00 

actggtatga ctctggtacc tctaaaagtt tacttaaaag atggttttgc taaagttcta 3 60 

ttgggtttgg caaaaggaaa acatgattat gataagcgag agtctatcaa acgtcgtgag 420 

caaaaccgtg atattgcccg tcagttaaaa aactataatt cacgatga 468 

SeqID 115 

atggaccatt tcacaaaact ctggcaagat ttctctaaac ttccaaatgt tgttgctata 60 

gcactcggag gctcacgctc tggtgacagt tttgaccaat cttccgatta tgatttgtat 120 

gtttattgtg cagcaactcc tgatatcact agtcgtaaac gtatccttaa caagcactgt 18 0 

cattacattg aacttaacaa tcattactgg gagcttgaag ataatggtac tttaaacgac 240 

ggaactgata ttgatattct ctatcgtaac atagataact ttttatcaga cttagaagat 3 00 

gtcgttgaac accacaattc tcgaattgga tacactactt gtttttggca taacctcatc 3 60 

aattgccaaa tactctatga tcctgaaaat caattacaat cactcaaaga gagattcgaa 420 

gtttcttatc ccagtcagtt acaaaaacaa attatcattc aaaatcgtaa cttattaact 48 0 

ggcaagcttc cctcttacga taaacaaatt ataaaagccc ttaaacgcca agactttgtt 540 

agtactcacc atagaactac tgctttctta gattcctact ttgatattat ttttgcactt 600 

aataagttga cacatcctgg cgaaaaaaga atgatttcct atgctaagaa gaatgctaca 660 

ttgcttccta aacatttcga agaaaatatc attaaactat gtcatcacaa ctccaacgaa 72 0 

cacactgtta aagaaacatt aaacgatata ataatgcatc tcgacgtcat gcttaaagaa 7 80 

aattttcaac actttatagg ttaa 8 04 

SeqID 116 

atgaaaaaaa ttatttatct tggtttagcc tgtgtcagta tattaacttt aagtggttgc 60 

gagtcaattg agcgttctct aaaaggagac cgtfcatgttg accaaaaact agctgaaaac 120 

tcttcaaagg aagctactga acaactaaac aaaaaaacta agcaagcctt aaaagctgat 18 0 

aaaaaagctt ttcctcaatt ggacaaggct gttgccaaaa acgaggcaca agtgcttata 24 0 

aaaacctcta aaggtgatat aaatatcaag ttatttccaa aatatgctcc tttagctgtt 3 00 

gaaaattttc taacacatgc aaaagaagga tactataatg ggttaagttt ccacagagtt 3 60 

atcaaagatt ttatgataca atctggtgac cctaatggag acgggaccgg tggaaaatct 42 0 

atatggaata gcaaagataa aaagaaagat tctggaaatg gttttgttaa tgaaatatcc 48 0 

ccttacctct acaatatcag gggaagttta gcaatggcaa atgctggagc tgatactaat 540 

ggcagccaat tctttatcaa tcaaagccag caagatcatt ccaaacaact gtcagataaa 600 

aaagttccta aggtgattat taaagcctat tcagagggag gaaatccaag tttagatggt 660 

ggttataccg tctttggaca agtaatctct ggcatggaaa ctgtggataa aattgcttcg 72 0 

gtagaagtta caaaatcaga tcaaccaaaa gaaaaaatta ctattacaag tatcaaagtt 78 0 

attaaagact acaaatttaa ataa 804 

SeqID 117 

atgaaaattg gaattattgc agccatggag gaagagttaa aactccttgt ggaaaatctt 6 0 

gaggataaat ctcaagagac agtattatca aatgtatatt acagtggacg ttatggtgaa 120 

catgagcttg ttttagtaca gtctggtgtt gggaaagtca tgtcagcgat gagtgttgct 18 0 

attttagtag aaagctttaa agttgatgct attattaata caggatcagc tggtgcagta 24 0 

gcaacaggat tgaatgtagg agacgtcgtt gtcgcagata cattagttta tcacgatgta 3 00 

gatttaaccg cctttggtta tgactatgga caaatgtcaa tgcagccact ttattttcat 3 60 

tcggataaga cttttgtttc cacttttgaa gcagttttat ctaaagaaga aatgactagt 420 

aaagttggtt taattgcaac tggggatagt tttattgctg gccaggaaaa gattgatgtt 480 

attaaggggc attttccaca ggttttagct gtcgagatgg aaggtgcagc aattgcccaa 540 

gctgctcaag ctacaggaaa gccctttgtg gtggtacgtg ctatgagtga tactgcagct 600 

catgatgcca atatcacatt tgatgaattt atcattgaag ctggtaaacg ctctgcccaa 66 0 

gtattaatgg catfctttgaa agcattataa 69 0 

SeqID 118 

atggaaaaga gacttagttt aggagctctt gttttagcta gtacagttct tttagcagca 60 

tgtggaaatg tcggaggagg agcatcttcg acaggaacaa aaatagggaa agatattaag 12 0 

gtagggtata actgggaatt atcaggaaac gtttcttcgt atggaaattc aatgaagaat 180 

ggagctgact tagcagtaaa agaaattaat gctgctggag gggttggcgg caagaagtta 24 0 

aaagtcctat cacaggataa taaatcagaa aatgcagagg cagcaacggt tgccacaaac 3 00 

ttggttacca aaggagctaa tgtcattatc ggaccagcaa catcgggtgc agctgcatct 360 

tcaactccaa aagtaaatgc agcagcagtt ccaatgattg cacctgctgc gacacaagac 42 0 

aatttagtct atggttctga tggaaaaacc ttaaatcagt atttcttccg agctactttt 48 0 

gtcgataatt atcaaggaaa gctattgtct cagtatgcta cagacaacct taaagctaaa 54 0 

aaagttgttc tattttatga taattcatca gattactcaa agggggtagc aaaatcattt 60 0 

aaggaaagtt atagtggaaa aattgttgat agcatgacat tctccgctgg tgatactgat 660 
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ttccaagcgt cattgactaa gttgaaaggg aaagaatatg atgctattgt gatgccaggt 720 

tactataccg agacaggatt aatagttaag caagcgcgtg atttaggtat ctctaaaccg 78 0 

gttcttgggc ctgatggttt tgatagtccg aaatttgtgc aatcggcaac acctgtggga 84 0 

gcttcaaacg tttattattt gacaggtttc actacacaag gatcaaccaa agctaaagct 900 

ttccatgatc attacgttaa ggcatatggt gaagaaccat ccatgttctc ggctttatca 96 0 

tacgatgccg tgtatatggc agctaaatct gctaaaggag ctaaaacttc aattgatctt 102 0 

aaaaaagctt tagctaagct gaaagatttt aaaggtgtga cagggaaaat gtctattgat 108 0 

aaaaatcata atgttgttaa atcagcttac gttgtcaaat tagacgatgg aaaaacaagt 114 0 

agcgttaaca ttatttcagc aaaataa 1167 

SeqID 119 

atggagaatc ataattctat taaacagacc tatggattaa tgactacaat tgctatgatt 60 

gtaggtgtag tgattggttc aggcatttat tttaaagttg atgacatcct taaatttact 12 0 

ggtggagatg tctttttagg catggtaatt ttagtactag gttctttttc gattgttttt 18 0 

ggaagtttgt caatttctga gctagctatt agaacgagtg aaagtggtgg tattttttct 24 0 

tactacgaaa aatatgttag cccagcttta gcagcaactt taggcttgtt tgcgtctttt 3 00 

ttgtatcttc caactttaac agcaatcgtt tcatgggtag ctgcatttta tactttaggt 3 60 

gaatcatcaa gtttggaatc acagattatt ttagctgctg tttatatttt agcacttagt 42 0 

ctcatgaata tctttgctaa gcgtattgca gggggattcc aatctttaac gacctttgta 48 0 

aaaatgattc ctttagtcct aattgcttta attggtgctt tttggtctga taaagctcca 540 

caacttccac aacatttaac agctattcag ccgtcaaatg ttggatggag ttgggtgtca 600 

ggtctggtac cattgtattt tgcatatgat ggttggacaa tttttgttag tattgcacca 6 60 

gaagtcaaga atcccaagaa gaatttacca cttgcctttg tcattggtcc agcgcttatc 72 0 

ttattatcat atttggcatt tttctatggt ttgacacaaa ttttaggtgc tagctttatt 78 0 

atgacaaccg gtaatgatgc tattaattat gcagctaaca ttattttcgg tcctagtgtt 84 0 

gggcgtttat tatctttcat tgttatttta tctgtattag gtgttgccaa tggactttta 900 

cttgggacga tgcgtctacc tcaagctttt gcggaacgcg gatggatcaa aagtgagagg 9 60 

atggctaaca taaatttgaa gtatcaaatg tcattgccag ctagtttgac ggtgacggct 102 0 

gtagctattt tttggctttt tgttcacttt atggttacaa aatttaattt attgccgggt 108 0 

agtgatatta gtgaaattgc tgtggtgttt aataatacta gtttaatcat tctttatgtt 114 0 

ttagtactca gcctttatct caaaaaagat attaaaaata aatttaccgg cttagtttca 12 0 0 

ccgattctag caattttagg cggtttgatc ctctttatcg ggagtttgtt aagtaatttc 12 60 

tttacggttt taattttcca atgtttttgc ttattattct gtttgatttg tcattatatc 1320 

tatcaaaaaa ataacccaaa aactcatgaa tag 13 53 

SeqID 12 0 

ttgaccgaat ttaatgacga tcagcattct aaccatgatc agaaaagttt taaagaacaa 60 

attctggcag agfctagaaga agctaaccgt ttgagaaagt tgcgtgaaga agaactctat 12 0 

caaaaagagc aagaagccaa agaagctgct cgcagaacag cacaattgat ggcagattat 18 0 

gaagcgcaac gtfctgaaaga cgaacaagaa gctagagcaa aggcgctaga gactaagcaa 24 0 

cgcttagaag agcaagagaa ggctcggatt gaggcgaaac ttctggcaga agctgctcga 3 00 

gaagaagaaa gacgtcaggc ggaacaagca ttagcttccc aagaagaaca agttatcaat 3 60 

caagggatgg aaccttcaag agagcttgat agcggctcca aatctagtga atttaggaca 42 0 

actgaaaatg tacctgacat agatttaaaa gcagataaga ctgatgttgc aactgctgtg 48 0 

cctaatcaag aaacggaaga aatctttctt gtgagagcaa cagatattcc tacagaaggt 54 0 

gaaaatgtaa aacttggtga gacatcagag ttagagcccg tagcaaagga acctatccga 60 0 

gtagaggatt tgfccaaaaga agaagaggat atagctttgt cagccaagaa caaacataac 660 

aagcgtgaaa gacgtcagaa agcagataat gtggcaaaac gaattgcacg tattctgata 720 

tcaattatca tacttgfcatt gctcttgaca gcatttgtag gttatcgttt tgtagatagt 780 

gctattaaac ctgttgattc caattcaaat aaatttgtac aagttgaaat tccgatagga 84 0 

tctgggaata aattaattgg tcaaatttta gaaaaagcag gagtcattaa gagcgccaca 900 

gtctttaatt actattccaa gttcaaaaac tatagtaatt ttcaaagtgg atactacaat 96 0 

cttaaaaaaa gtatgacttt agatcagatt gctgctgaat tggaaaaagg cggaactgca 102 0 

gaaccgacaa aaccagcttt aggtaagata ttaattaccg agggttacac tattaaacag 10 8 0 

attgctaaag ctattgagtc taataaaatt gatacaaaga ctacaagtac accttataaa 114 0 

gctgatgatt tcttgaaact tgttcaagat gaaacgttta ttaaaaaaat ggtagctaaa 12 0 0 

tatccaaatt tactaggaag tctacctgat aaatcaaaag ctatttatca attggaagga 12 6 0 

tatttattcc cagctactta caactattat aaagatacaa ccttagaagg acttgttgaa 132 0 

gatatgattt caacaatgaa tactaagatg gcaccttact ataacactat taaagcgaag 13 8 0 

aatatgtctg taaatgatgt tttgacatta tcatctcttg ttgaaaaaga gggatcaaca 144 0 

gacgaagatc gccgtaaaat agcaagtgtc ttttataacc gattatcagc cggtcaagct 150 0 

ctfccaaagta acattgcaat tttgtatgct atggggaaat taggtgataa aacaagttta 156 0 

gctgaagatg ctcaaattaa tacttcaatt aaatcaccat acaacatcta tactaatact 162 0 

ggtttgatgc ctggaccagt tgatagtcca agtatctcag caattgaagc gactataaaa 168 0 

cctgcttcaa cagattatct ttattttgtc gctgatgtga agaccggaaa tgtatactac 1740 
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gctaaagact ttgaaacaca caaggctaac gttgaaaaat atatcaatag tcaaattaac 18 0 0 

taa ~ J 18 03 

SeqID 121 

atgagtagaa tgattccagg acgtattcga aatcaaggta tagaactcta cgaacaaggt 6 0 

ttggtgtcct tgatttctca agaaggtaat ctattaaaag ctaaagttgg ggattgtcag 12 0 

attgaatatt cattagtaac agaagaaaca aaatgttctt gtgatttttt tgcaagaaaa 18 0 

ggctattgtc aacatttagc tgctctggag catttcttga aaaacgaccc agagggtaaa 240 

gcaatcttat ctaaagttca agttcaacaa gaaagtcaac aagaaactaa gaagaaaacc 3 00 

tcttttggta gtgttttttt agatagtcta attattaatg aagatgatac tataaaatat 3 60 

caactatcgg ctcaaggaga acaaaatcct tatgctaatg atatctggtg gactttaaaa 420 

attagacgac tgcctgatga tagaagttat gttattcgtg acatcaaagc ttttttaaat 480 

actgttcgaa aagaagctta ttatcagatt ggaaaacaat actttgaaac tttgtctttg 540 

atacagtttg atgagaccag tcaagaattg atagaatttt tatggaggct cattccaagt 60 0 

cattcaagca aaattgatct agagtttatt ttaccaaatc aaggtcgtca ccttagtctc 660 

actcgtggtt tttttgaaga gggcgttact ttgatgaatg ctttagaaaa tttttcattt 72 0 

gaaagtgatt ttcatcaatt taatcacctc tactttaagg aattagaagg tgaggatcat 780 

ctttatcaat ttaaggttat tgtccatagg caatcaattg aattagaaat caaagagaaa 840 

gacttaaaac cactatttgc gaattcttat ttgttttatc gcgatacttt ctaccatctt 9 00 

aatttgaagc aagaaaagat ggtaacggct attcgtagtc ttcccattga gggagattta 9 60 

gcaaaacata tccattttga tttagatgat caagataagt tagcagctca tttactggat 1020 

tttaaggaga ttggactagt agatgctcca cggagcttct ctattcatga ttttaaggta 10 80 

aattttgagt tcgatattaa tagtcaaaat gaaattttat tgcaaatggt atttgattat 1140 

ggtaatgatt taactgttca taatcgtcaa gaactagaac agttaacatt tgctagtcac 12 00 

ttcaaacatg aagaaaaaat atttaaactc ttagaaagat atggctttgc accccacttt 12 60 

tcaaccagcc atccagcata cagtgcacaa gaattatacg atttttatac ctatatgctt 13 20 

ccacaattta aaaaaatggg aacagtgagc ttatccgcaa aattggaaag ttatcgtctc 13 8 0 

atagaacgac cgcaaattga tattgaagct aagggaagcc tgctagatat ttcttttgat 144 0 

ttctctgatt tactggaaaa cgatgttgat caagctctgg tagctttgtt tgacaataat 15 00 

ccttattttg tgaataaatc agggcaatta gttatttttg atgaagagac taagaaagta 1560 

agtgcaactt tgcaagggtt gagagctaga cgagcaaaga atgggcatat tgaattagat 162 0 

aatattgcag catttcaatt atcagaactg tttgcaaatc aagataatgt tagcttttca 1680 

caacattttt accaattgat tgaagatttg agacacccag aaaagtttaa aatacctggt 1740 

ctatctgttt cagcttcctt gcgcgattat cagcttacag gtgtccgttg gttaagtatg 18 00 

ttagatcatt atggctttgc agggatttta gctgatgata tggggcttgg aaaaacactt 18 60 

caaacgattt catttttaag tacaaaacta actagagatt ctcgtgtgct tattttatca 1920 

ccatcaagtt taatttataa ttggcaagat gaattccaca aatttgcgcc agacgtagat 1980 

gtagcagttg cctatggttc taagattcgt agagatgaaa ttattgctga acgtcatcaa 2 040 

gtaatcatta caagttattc atcatttagg caggattttg agacttattc tgaaggaaat 2100 

tatgattatt taattttgga tgaagcacag gttatgaaaa atgctcaaac taaaatagcc 2160 

catagcttga gatcttttga ggtcaaaaac tgttttgcat tatcgggaac acctattgag 222 0 

aataaactgt tagaaatttg gtccattttc cagatcattt tacctggttt attacctggt 22 8 0 

aagaaagagt ttttgaagtt gaaccctaaa caagttgctc gttatatcaa accatttgtg 234 0 

atgagacgtc gaaaagaaga agttctccca gaattaccgg atttgattga aatgaactat 24 00 

ccaaatgaaa tgacagacag tcaaaaagtt atctatttgg cgcaattacg ccaaattcaa 2460 

gaaagtatcc aacattctag tgacgctgac ttaaatcgtc gtaaaattga aattttatct 2520 

ggcatcacac gattacgtca aatttgcgac actccacgtc tttttatgga ttatgatggt 2580 

gaaagcggta agttagaaag cctacatcaa cttttaacac aaattaagga aaatggacat 2 640 

cgtgctctga ttttttctca gttcaggggt atgttagata tcgcagaacg tgagatggta 27 00 

gctatggggc ttacaactta taaaattact ggttctacac cagcaaatga acgtcatgaa 2760 

atgaccagag catttaatgc aggatcaaaa gatgctttct tgatttcttt aaaagctggt 2 82 0 

ggcgttggtc tcaaccttac aggagctgat accgttgttc tgattgactt gtggtggaat 2 880 

ccagctgttg aaatgcaggc gattagtcgc gcacataggt tggggcaaaa agagaatgtc 2 940 

gaggtttatc gcttgattac tagaggcaca atcgaagaaa aaattctaga aatgcaagaa 3 0 00 

actaaaaaac atttggttac aacagttctg gatggaaatg aaacccatgc tagcatgagt 3 060 

gtagatgata ttcgagagat acttggagtt tctaaataa 3 099 

SeqID 122 

gtgttactta cagaaatcaa aaaat caeca gaggggcttt attttgataa aaaaatagat 60 

atcaaagagt cgcttatgga gegtcattea gagataatgg atatttctga tattcaagtt 120 

tcaggtcatg tggtgtatga agatggtctc tatctacttg actataatat ggcttacgat 180 

attacattac cgfcctagtcg ttctatgaaa ccagttgttt tatctgaaaa gcaaactatt 24 0 

aatgaagttt ttattgaagc tgaaaatgtt agcactaaaa aagagctcgt tgatcaggat 3 00 

ttggtcttaa tattagagga agatgatatt aatttagaag aaagcgttat cgataatatc 3 60 

ttgttaaata tacctcttcg tgttctggca gctgatgagg ttggggtaga ageggattta 420 
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tcaggtaaaa attggtcgtt aatgactgaa aagcagtatg aagaaaaaca agctaaagaa 480 

aaagaaaaat ctaatccttt cgcagcctta gaaggtatgt ttgatagtga ttag 534 

SeqID 123 

atggaagtaa aaactgcaat agaatggatg cacactttta accaaaaaat tcaatccaac 60 

aaggactatt tatccgaact agatactcca ataggtgatg gcgaccatgg tggcaatatg 120 

gcgcgtggta tgactgctgt tattgaaaac cttgataaca atgaatttag tagtgctgcc 18 0 

gatgttttca aaaccgtatc catgcaactt ttaagcaaag taggcggagc atctggacca 240 

ctctatggtt ctgcatttat gggaattaca aaagctgaac agagtgaaag taccatttca 3 00 

gaggcacttg gcgcaggact tgaaatgatt caaaaacgtg gtaaagctga acttaatgaa 3 60 

aaaacaatgg ttgatgtttg gcacggtgtt attgaagcga ttgaaaaaaa tgaattgact 420 

gaagaccgta ttgattcctt agtagatgct acaaaaggga tgaaagcaac aaaaggacgc 48 0 

gcttcttatg ttggagaacg ctcattagga cacattgatc cagggtcatt ttcttcagga 540 

ttattattta aagcattact agaagttgga ggtgtctaa 579 

SeqID 124 

atgcaagata aattaatgat aagaggagca agagctcaca atttaaaaaa tataagtgtc 60 

gacatcccac gagacaagtt agttgtcgtt acgggactat caggatcagg taagtctagt 120 

ttggcctttg acactatcta tgcagaagga caacgtcgtt acgttgaaag tctttctgct 180 

tacgcgcgcc aatttttggg aaatatggaa aagcctgatg tagattctat tgatggcctt 240 

agtccagcta tttctattga tcagaaaaca acaagtaaga atcctcgctc tactgttggg 3 00 

actgtaacag aaattaacga ttatttacgt ttgctttatg ctcgtgttgg aaccccatat 3 60 

tgtatcaatg gtcacggagc catcacagca tcttctgtag agcaaattgt ggataaggta 42 0 

ctagcgcttc cagagcgaac aaagatgcaa attttagccc caattatacg tcgaaaaaaa 4 80 

ggacaacaca aatcaacatt tgagaaaata caaaaagatg gttacgttcg tgttcgtatt 540 

gatggtgata ttcacgatgt tactgaggtg cctgaattat caaaaagtaa gatgcacaat 6 00 

attgatattg tagtagatcg attaattaat aaagagggca ttcgttcccg cttatttgat 660 

tccgttgaag cggctctacg tttgtcggat ggttatgtcg ttattgatac aatggatggc 720 

aatgagttat tgttttcaga gcattattct tgtcctgaat gtggttttac tgttcctgaa 78 0 

ttagagccac gccttttttc ttttaatgct ccctttggtt cttgtacgac ctgtgatggc 840 

ttaggaatta agttagaagt ggatatagat ttggttattc cagataaaag caagaccctt 9 00 

agagaagggg ccttagtt cc ttggaatcca atttcatcaa attatfcatcc aacgatgcta 9 60 

gagcaagcta tgactcaatt tgatgttgat atggatacgc catttgaaaa attaagcaag 1020 

gcggagcagg acctagccct atatggttca ggcgaacgag agttccattt ccattatatt 10 80 

aacgattttg gaggcgagag aaatattgat ttgccttttg aaggggttgt caataatatt 114 0 

aatcgtcgtt accatgaaac taatagcgat tacacacgca atgttatgcg agaatatatg 12 00 

aacgagctaa aatgtaacac ttgtcatggc tatcgtctca atgatcaagc actttgcgta 1260 

agagttgggg gagaagaagg tcttaatatt gggcaagtct cagatttatc gatagcagac 13 20 

catttagaac ttttagaaac gttgcgttta tcttcaaatg aacaattaat tgctcgacca 13 80 

attatcaaag aaattcatga ccgcctaagt ttcttaaata atgttgggct taattattta 144 0 

aatctgtcac gttctgcagg aacgctttcc gggggcgaga gtcaacgtat tcgtttggct 15 00 

actcaaattg ggtctaattt atcaggtgtc ctttatgtac ttgatgaacc gtccattgga 15 60 

cttcatcagc gagacaatga ccgtttgatt gatagtctta agaaaatgcg tgatttgggt 1620 

aatacgctta ttgttgttga acatgatgaa gatacaatga tggcagctga ttggttaatc 168 0 

gatgtgggac ctggcgctgg tgcgtttggt ggtgagattg tggcctctgg cacaccgaag 174 0 

caagtagcta aaaatactaa atctattaca ggacaatatt tgtcaggtaa gaaagtgafct 1800 

cctgtgccat ctgaacgaag agtaggcaat ggtcgctttt tagaaatcaa aggtgcggca 1860 

gaaaacaacc tacaaaattt agatgtcaag ttccctctcg gtaaatttat tgccgtaact 192 0 

ggtgtctcag gttctggaaa atcaacctta attaatagta ttttgaaaaa agctgttgcg 198 0 

cagaagctta accgtaactc agataaacca ggtaaatatg tttctttaga aggcattgaa 2 040 

tatgttgatc gattgattga tattgaccaa agtccaattg gtcggacccc acgttctaat 2100 

cccgcaacct atacaggtgt ttttgatgat attcgtgatc tttttgctca aactaatgag 2160 

gcaaaaattc gaggttataa aaaaggacgc ttctcattta acgtcaaagg tggacgttgt 222 0 

gagtcttgtt ctggtgatgg tatcatcaaa attgagatgc attttttacc agatgtctac 22 8 0 

gttccttgtg aagtatgcca cggcacgcgc tataatagtg agacfcctgga agtacattac 2340 

aaagaaaaaa atattgcaca aattcttgat atgacagtga atgatgcagt tacatttttt 24 00 

gcagctattc ctaaaattgc tcgtaagtta cagactatta aggatgttgg tttgggatat 2460 

gtcactttgg ggcaaccagc tacaacctta tcaggaggag aagctcagcg aatgaagttg 2 52 0 

gcaagtgaat tacataagcg ttcgacaggt aagagtcttt acatattaga cgaaccaaca 2580 

acggggcttc atgcagatga tattgctcgc ttacttaagg ttttggatcg ttttgtagat 2 640 

gatggtaata cagtgcttgt tattgagcat aaccttgatg ttattaaaac tgctgaccac 270 0 

attattgatc tcggccctga aggaggtatt ggtggcggac aaattgtagc tatcggaaca 2 760 

ccagaagaag ttgctgaaaa tcctaaatcc tatacaggat attatttaaa agagaagtta 282 0 

gcaagatag 2 82 9 
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SeqID 125 

atgtggcctg aagatagaat cgcttctttt cgtcgcacac ttttagggtg gtatgaccaa 60 

gaaaaacgtg atctcccttg gcgaagaact actaatccat actatatttg ggtttctgaa 120 

atcatgttgc aacaaacaca agttaataca gtaatccctt attataaacg ttttctagag 180 

tggtttcctc aaataaaaga tttggcagac gctcctgaag aacaattact aaaggcatgg 240 

gagggactag gttattattc tcgggtgcgt aatatgcaaa aggctgcaca gcaagtgatg 3 00 

gttgattttg gtggcatatt tcctcacact tatgatgata ttgcttcatt aaaagggatt 3 60 

ggtccttata ctgcaggtgc tatcgccagc atctcattca acttaccaga gcctgccgtt 42 0 

gatggcaatg tgatgcgtgt catggctcgt ttgtttgaag ttaattatga catcggtgac 48 0 

cctaaaaatc gtaaaatttt tcaagctatt atggaaatac taattgaccc tgacagaccc 54 0 

ggtgatttta accaagcact tatggatctg ggtacagaca tcgaatccgc taagacacct 60 0 

agacctgatg aaagccctat tcgtttcttt aatgctgctt atttgaatgg tacttatagt 66 0 

aaatatccca taaaaaatcc taagaaaaag cctaaaccaa tgcgaattca ggcttttgtc 72 0 

attcgtaacc aaaatggcca atatctactg gagaaaaata ctaaagggag acttctaggg 78 0 

ggtttttggt catttcctat tatcgaaacg tcacctcttt ctcaacaatt agatttattt 840 

gatgacaatc aatccaaccc cataatctgg caaacccaaa atgaaacttt ccaaagagaa 900 

tatcaattaa agccccaatg gactgataat cactttccca atattaagca cacttttagt 96 0 

caccagaagt ggacaataga actcatagag ggggtggtca aagcaacaga tttacctaat 102 0 

gctcctcacc taaagtgggc agcaattgag gacttttctc tgtatccctt tgcaacgcca 10 8 0 

caaaagaaaa tgctggagac ctacttaaaa cagaaaaatg cttaa 112 5 

SeqID 126 

ttgatcgaaa tcacttggac tgtgaagtat atcacagaat ttatcgcaac cgcttttctt 6 0 

atcatcttag gaaatggggc tgtagctaac gttgacttaa aaggcacaaa aggaaataat 12 0 

tcaggttgga ttattattgc aattggttat ggtttaggtg ttatgatgcc agcgcttatg 18 0 

tttggtaatg tctctggaaa ccatattaac ccagctttca cacttggatt ggcattttca 240 

ggtcttttcc cttgggcaca cgttggacaa tacattcttg cacaaatttt gggagcaatg 3 00 

tttggacagt tagttgtcgt tatggtttat caaccttact ttgttaagac agaaaatcca 3 60 

aatcatgttc ttggatcatt ttcaactatt tctgcattag atgatggaca gaaatcaagt 420 

cgtaaagcgg cttacattaa tggtttctta aatgaattcg ttggttcttt tgttcttttc 48 0 

tttggagctt tggctttaac gaagaactat ttcggtgtag aacttgttgg taaacttgtt 540 

caagctggtt atgaccaaac gacagcagcg acaaggattt caccatacgt aactggttca 600 

cttgctgtag ctcacttagg tattggtttc ttggtaatga ctctagttgc ttctttaggt 660 

ggaccgactg gacctgcatt gaaccctgcg cgtgaccttg gtccacgtat tgttcaccgc 720 

ttattaccaa aacaaattct tggacaagct aaagaagatt caaaatggtg gtatgcatgg 780 

gtgccagttt tagcaccaat cgtagcaagt atactcgctg tagccttgtt caaactatta 840 



tatctttaa 



849 



SeqID 12 7 

atgtcatctt attggaataa ctatcctgaa cttaaaaaaa atattgatga aaccaatcaa 60 

ctaattcaag aaagaataca ggtcagaaat aaagatattg aagcggcgct aagccaactc 120 

acagctgcgg gaggaaaaca gctcagacca gcattctttt accttttttc tcaacttggt 18 0 

aataaggaga atcaagatac tcagcaacta aagaaaatcg ctgcttcttt agaaatcctt 240 

cacgttgcta cattaatcca tgatgatgtc attgatgact caccactaag acgtggaaat 3 00 

atgaccattc aaagcaagtt tggcaaagac atcgcagttt atactgggga tttacttttc 3 60 

acagtctttt tcgafccttat tttagaatct atggctgata caccatttat gaggattaat 42 0 

gcaaaatcta tgcgtaaaat tctcatggga gaattggacc agatgcacct tcgttacaat 48 0 

caacaacaag gtatccatca ctatttacgt gcgatttcag gtaagacagc cgaactcttt 54 0 

aaattagcta gcaaagaagg agcttacttt ggtggtgcag agaaggaggt tgttcgtcta 60 0 

gcaggccata tcggctttaa cattggtatg acattccaaa ttttggatga tatcctggat 660 

tatactgcag ataaaaaaac atttaataag cctgtcttag aggatttaac acaaggcgtt 72 0 

tacagccttc ctctacttct tgccattgaa gaaaatcctg atattttcaa acctatttta 78 0 

gataaaaaaa cagatatggc tactgaagac atggaaaaaa ttgcttatct cgtcgtttcc 84 0 

catagaggtg ttgacaaagc tcgccatcta gctcgtaaat ttactgagaa agctattagt 90 0 

gacataaata agctacccca gaactctgca aaaaaacagt tgctacaatt aactaattac 960 

cttttaaaac gcaaaattta a 98 1 

SeqID 128 

atgtttaaaa tccctttgtt taaagaatta aagacagatc aatgggtaaa accatttttc 60 

aaacaatata aagtttcctt agtaattgct cttttccttg gttttatgac tttcttttct 12 0 

gccagtgctc ttatgtttaa ttcgggctat ctgattagta agtcagcttc actaccgagt 18 0 

aatattcttc tagtatatgt tccaattgta ctaacacgtg cttttggtat tggacgtcct 24 0 

gtttttcgtt atatagaacg cctaacgagt cacaattggg tattacgaat gacttcacaa 3 00 

ctgcgcttga agctttacca ttccttagaa agcaatgcta tctttatgaa gcgtgatttt 3 60 

cgtttaggag atgttatggg gcttttagcg gaggatatta attatttgca aaatttgtat 42 0 
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cttaggacaa tatttccaac gattatagcg tggttattat atagctttat tattattgca 48 0 

acaggcttct tttctctatg gtttgcttta atgatgttgc tttatttagc tattatgatt 54 0 

tttctttttc cactttggtc gattttagct aatggtgctc gtcagactcg tgaaaaagaa 600 

ttaaaaaatc atctttatac tgatttgacg gataatgtct taggtatttc agactggatt 660 

tttagtcaac gtggtcaaga gtacgtcgcc ttgcatgaac gttcggaaag tgaactaatg 72 0 

gctattcaga aaaaaattcg ttcctttaat aatcgtagag cfccttattgt ggaattagtt 780 

tttggctttc tagctatttt ggttattatt tgggctagta atcagtttat tggtcacaga 84 0 

ggtggcgaag ctaactggat tgcagcafctt gttctaacag tatttccctt gtcagaggca 900 

tttgctggtt tgtctgctgc tgctcaagaa acaaataaat attctgattc aattcatcgt 960 

cttaatgaat tatcagaaac gtattttgaa acaactcaaa atcaattgcc taataagcca 102 0 

tatgattttt ccgtcaagaa tttaagtttc cagtataagc cacaagagaa atgggttttg 108 0 

catcatttag atttggatat taaagaagga gaaaaaattg ctattttagg gcggagtggt 114 0 

tctggcaaga gtaccttagc tagtctactt cgtggcgact tgaaagcaag tcaagggaag 12 0 0 

attaccctag gcggtgctga tgtttcaata gttggtgatt gcatttctaa ctatattgga 1260 

gtgatacaac aagcgcccta cctttttaat acaaccctgc taaataacat tcgcattgga 132 0 

aaccaagatg ctagtgaaga agatgtatgg aaagttcttg aacgagttgg attgaaagag 13 8 0 

atggtcactg atttatcaga tggtctctat acaatggttg atgaagctgg acttaggttt 144 0 

tcaggtggcg aaaggcatag aatfcgctttg gctagaatct tactaaaaga tgtcccaatt 1500 

gttatcttag atgagccaac tgttggactt gatcctatta ctgaacaagc cttactaagg 1560 

gtatttatga aagaattgga aggtaagacc ttagtttgga ttacacatca cttaaaaggt 162 0 

attgaacatg ctgatcgtat cttatttata gaaaatgggc aattagagtt ggaaggttct 168 0 

cctcaagaat tgtctcaaag tagtcaacgt taccgtcagc tgaaagccgc tgatgatggt 174 0 

gatctatga 174 9 

SeqID 129 

atgaaaaaac ttattacaga aaaaaaagtc aacaacgtct caactgttaa ctatttgaaa 60 

ctcggtcttg tctcagcaat gtttgctgga ggagcattcg tagctcttgg ttctacacaa 12 0 

ggtgtttcag cttcaacttt cacagcaccg caagcaacac atccaaaagc agaacgtcag 18 0 

ttaacagatt cagaaatcta tgaacgtgcc caaaaacaag tgttaccaaa atatatccaa 24 0 

ggttctctct caggtatcct taaccaacat tcaacacttt acaaacaaca aaacgcagca 3 00 

gtgacaccgc aagtatcaag ccccaaagca gaacgtcagt taacagattc agaaatctat 3 60 

gaacgtgccc aaaaacaagt attgccaaaa tatattcaag gttctctctc aggtatcctt 42 0 

aaccaacatt caactttgaa tgcttaa 447 

SeqID 13 0 

atggaggaaa acatgaacat aaaacaatta aagtctaaaa caatgcttgg aacagtggct 60 

ttggtgtccg ctttttcatt tgcttcaaca aatgcagatg cgaatacata taactatgca 12 0 

gtagatgtag actacctagc aagtgcggag gaaattgctc aagcacatcc agctagtaat 18 0 

acgtttccac taggtcaatg cacatggggt gtaaaggaga tggcaacatg ggcaggtaac 24 0 

tggtggggaa acggtggtga ttgggctgcg agcgccgcat cagctggtta tacagtagga 3 00 

acacaacctc gcgtgggttc tatcgtatgt tggactgatg gtagttatgg acatgtggct 3 60 

tatgttacag ctgtagatcc tgtcactaat aaaattcagg ttttagaatc taattatgca 42 0 

ggtcatcaat ggattgataa ttatcgtggt tggtttgatc cacaaaatac agcgacacca 48 0 

ggggtagtta gctacattta tccgaattaa 510 

SeqID 131 

atgaaaatta accaaatgaa aaaagacgag ctttttgagg ggttttacct tataaagaaa 60 

gcagaagttc gtaaaacgcg tgcaggtaaa gactttattg cctttacctt tcgagatgafc 12 0 

actggcgaaa tttcaggaaa tatgtgggat gcacaaacct ataatgtaga agagtttgta 18 0 

gccggtaaaa ttgttcatat gaaagggcgt cgtgaagttt ataatggaac tccccaagtc 24 0 

aaccaaatta cattaagaaa catcaaagat ggtgaaccta atgatcctag agattttaaa 3 00 

gaaaaacctc caatcaatgt tgacaacgtg agagagtaca tggaacaaat gctctttaaa 360 

attgaaaatg caacctggca gcgtgtggta agagcaattt atcgtaaata caataaggaa 420 

ttttttacct atccagccgc caaaacaaac caccacgcct ttgaatcagg attggcatat 48 0 

cacacggcaa cgatggttcg fcttggcagat agtattggag atatctat cc agaacttaat 54 0 

aaaagtttga tgtttgctgg tattatgcta catgatttag ccaaggtcat tgagttatcg 60 0 

ggtcctgata atacagaata tactattcga ggtaacctta tcggtcatat ttcacttatt 66 0 

gatgaggaat fcaacaaaaat actagcagag cttaatattg atgatacaaa agaagaggtt 72 0 

acagtgttac gccatgttat tctgagtcac catggacaat tagaatatgg aagcccggtt 78 0 

cgtcctagaa tcatggaggc agaaattatc catatgatcg ataacattga tgcaaatatg 84 0 

atgatgatga caacagcttt aaatcgtgtc aacgaaggtg aaatgacaaa tcgcattttt 900 

gccatggata accgttcttt ttataaaccg aacattaaafc aa 942 



SeqID 132 

atgttggttg acaagaagtg gcgttttgaa gatagtgcaa gttatttcgc ttgtcctaaa 



60 
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tgtcaaaatc ccttgataaa agaaagtaac agcctcaaat gctctgataa tcattgtttt 120 

gatttatcca aatttggata tgttaafcctt cttggaggaa agaagataga tgaacattat 180 

gataaaaaat catttgaaaa tcggcagctt gtcttagaaa acgggtatta taatcatatt 240 

ttagaagcta tttcaaaggt tttggaaaat aattctcagt ttcattcggt tttagatatt 3 00 

ggttgtggag agggctttta ttctcgtcaa ttagttaaca agtatgagaa gactttcttg 3 60 

gcttttgafca tttctaaaga ttcgattcaa ttagctgcta aaagtgatca atctaggctt 420 

gtgaaatggt ttgtttctga tttagctaac ctfccctattc aagactcatc aattgatatc 480 

attttagaca ttttttctcc tgcaaattat aaagaattta ggcgggtact gagtgatgat 540 

ggtatccttg taaaggttgt tcctgtagct gaacatgtcc aagaattgcg agaaaaagct 600 

agtcaatact tgaaacagaa agattactct aatcaaaaaa ttctagatca ctttagagaa 660 

aattttgaaa ttatttctga gcagaaagtt gttcaaagtt ataactgttc tcaacaagaa 720 

cgtcaagctt ttattgatat gacaccgttg ttattcagcg tagataaaac gactattgat 78 0 

tgggcgagta tatcagaaat tacagtaggt gccttaatag taattggaaa aaagcgcagt 840 

gtatctaaat aa 852 

SeqID 133 

atgtgtctta tttgtcaacg tattgaaatg attgaacgaa atgagaatcc ttattttgtg 60 

aaagaatatg aaacgggtta tcttgttctg ggagaccatc aatattttca aggttattgc 12 0 

ctttttctct ctaaaaaaca cgtaacagag ttacatgaat tgcctagaga ttggcgaaat 18 0 

cagtatcttt ccgaaatggc agatgcttct gaaatagttg ctaaggcttt tagagcagat 24 0 

aagatgaata ttgagagttt gggaaatgga gacgctcatc tacattttca tctttttcca 3 00 

agaaaaacag gtgatttaag gaactatggg cataatggaa aaggtccagt ttggtggtat 3 60 

ccttttgaga agatgtatgc agatagtgtc agagcaactg gtgcagaaat agaaaaatta 42 0 

aaagagaaat tgttagatgt cttaggatag 45 0 

SeqID 134 

atgtttagtg atttacgcaa aaaatttgtc tttttgacca tgtcaatcct tattgtcgtt 60 

gttctctttc tctttgcagt gagtaatcgc tacaatcaat attgggatga atatgatgca 12 0 

tataggattg ttaaattagt tgccaaaaat gattatttag gaataccagg cgatgaaccg 180 

attgctttgg tgactataga taaccaaaag atggttaaga ttcaaagcaa taatactgat 240 

ttgacaaatg atgttatcga aaaatctagt ttgaagctcc ttgagcaagg aaagaagtca 3 00 

cgaaaatgga agtcttttat atattccata aaagaatata aagataaaac ctacaccata 360 

gctattatgg attfcggctag ttatgaagtc ccttatgcaa ggcgttttct aattcttgtt 420 

tttactattt ttggattttg tttattagca gctgtaagtc tttatctttc tcgatttatc 48 0 

gtagggcctg tagagacaga aatgactcgc gaaaaacagt tcgtttcaga tgcgagtcat 54 0 

gagctgaaga cccctattgc agctatacga gccaacgttc aggttctgga acagcaaatt 600 

ccaggaaatc gttatctgga tcatgttgtt tcggagacta agcgaatgga gtttttaatt 660 

gaagatttgc ttaatttgtc tagacttgat gaaaaacgct ccaaggtaaa tttcaaaaag 720 

ttaaacttat ctgtcttatg tcaagaagta cttctgacat atgaaagttt ggcatatgaa 78 0 

gaagaaaagt gtttaaatga tacgattgaa gatgatgtct ggattgttgg agaagagagt 840 

cagattaagc aaattttaat aattcttttg gataatgcta ttaggcattc tttgtcaaaa 900 

tcagaaatcc agtttagtct taaacaggct cgaagaaaag ctattctgac aatatcaaat 960 

ccaagtgcta tctattccaa agaggttatg gataatctct ttgagcgttt ttatcaagct 102 0 

aaagatgatc atgcagattc tctcagtttt ggtctaggct tatctattgc taaggcaatc 108 0 

gtggaacgcc ataaggggag aattagagca tatcaagaaa aagaccaatt acgtcttgaa 114 0 

gtccagttac caattgatgg attttggacc aataccatga ttaattaa 1188 

SeqID 13 5 

ttgattattc tcgataaaaa aagttatgac ctcctctttt acctattgaa attagaggaa 60 

cctgaaacag ttatggcaat tgccaacgca cttaatcagt ctaggagtaa agtgtattat 12 0 

cacttagaga agataaacga tgcactgcct agcgatgtgc ctcagattgt tagttatcca 18 0 

cgtgtaggaa tcttgctaac cgaaaaacaa aaagcagcct gtcgtctttt attagatgaa 24 0 

gtaactgatt atagttacgt catgaaaagt agtgagaggt tgcagctgtc tttagtatct 3 00 

atcgtggtag ctaaggaccg cataacgatt gataggttga tgcaactaaa cgatgtttct 3 60 

cgaaatacta tcttaaacga tttaaacgaa ttaagaagtg agttagcaga gaaagaatat 42 0 

aatttacagt tacaatcaac aaaatgtcgc ggttattttt tagatggtca cccattgtcc 480 

attatccagt acttatataa gctcttagat gatatctacc ataatggaag tagtagtttt 54 0 

atagaccttt ttaatcataa actgtctcaa gcttttggtg ccagcacata tttttctaaa 600 

gaggttcttg attattttca ccattatctc ttcatttctc aacgaagtct aggtaagaaa 66 0 

atcaacagtc aagatggtca gtttatgatt cagattttgc cttttatact aatggcttat 72 0 

cgtaagatgc gattaagtcc tgaagtacag acctctctta atagtgattt tagcttggtt 78 0 

tggcaacgta aggaatatga gattgctaaa gagttggctg atgagctgga agaaaatttt 84 0 

cagttatcac tggatgagat fcgaagtggga ctagtggcca tgcttatgct tagfcttccgc 900 

aaggaccgtg acaatcattt agagagccag gattatgatg atatgcgagc tactctaaqc 96 0 

agttttttga aagaattgga agaacgatat caccttcact ttgttcataa aaaggactta 102 0 
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ctaagacaac ttctaactca ctgcaaggca ctcttatatc gtaaacgtta tggtattttt 1080 

tctgttaatc ctttaacaga gcatattaaa gacaaatatg aagaactttt tgccataacc 1140 

tcgtcttctg taaagctttt agagaaagct tggcaaatca aattgaccga tgatgatgta 12 00 

gcatatctaa cgattcattt aggaggggaa cttcgtaata gtcaacaatc tcctaataaa 1260 

cttaagttag ttattgtatc tgatgaagga atagcgattc agaaacttct tttaaagcaa 13 20 

tgtcaacgct acttaacaaa cagtgatata gaagctgttt ttacaaccga acagtaccaa 13 8 0 

agtgtgagtg atcttatgca tgtagatatg gttgtctcta ctagtgacgc tttagaatct 1440 

cgttttccga tgttagtagt tcaccctgtt ttgacagatg atgatattat tcgcttgatt 1500 

cgcttttcaa aaaaaggtaa ctgtgcaaat agtaatcaat ttaccaatga acttgaaaaa 1560 

acgattgctc aatatgtcaa ggaagatagt gaacgctacg tgctgaaatc taagattgag 1620 

aaacttattc atcaagaatt gctccaagac gtccttcccc ttcaaagtac agtttgttaa 168 0 

SeqID 13 6 

atgaaatact ttttagatac tgccgatgtt tctgaaattc gtcgtcttaa tcgtttagga 6 0 

atcgttgatg gtgttactac caatccgact attatttcac gtgaggggcg tgactttaaa 12 0 

gaagttatta acgaaatttg ccaaatcgtt gatggcccag ttagtgcaga agtgacaggt 18 0 

ttaacttgtg atgaaatggt taccgaagct agagaaatcg ctaagtggtc accaaatgtt 24 0 

gttgttaaaa ttccaatgac tgaagaaggt ttagcagcag tttcacaatt atcaaaagaa 300 

ggtataaaga caaatgtgac attaatattt actgtggctc aaggtctttc ggcaatgaaa 360 

gctggtgcaa catttattag tccttttgtg ggtcgtcttg aagatattgg aacagatgct 420 

tatgctttga ttcgtgacct tcgtcacatt attgattttt atggtttcca atctgagatt 48 0 

attgctgcta gcattcgtgg attggcgcat gtggagggcg tcgctaaatg cggtgcgcac 54 0 

attgccacta ttccagataa aacatttgca tcactattta ctcacccgct tactgataaa 60 0 

gggattgaaa ctttcttaaa agattgggat agtttcaaga agaagtag 64 8 

SeqID 13 7 

atgagtcatt attccataaa attacaagaa gtattcagat tggcgcaatt tcaagcagca 60 

cgctatgaaa gtcactattt agaaagctgg catcttcttt tagcaatggt tttagttcat 12 0 

gattcagtag ctggtttgac atttgctgaa tatgagtctg aagttgcaat tgaagagtac 180 

gaagcagcaa caattttagc tttaggtaga gcaccgaaag aagagattac caattatcaa 24 0 

tttttagagc aatcaccagc tttgaaaaag atcttaaagt tagcagaaaa tattagtatc 300 

gtagtaggag ctgaagatgt gggtactgaa catgtcttgc ttgctatgtt ggttaataaa 36 0 

gat t tat tag ccacacgtat cttagagtta gttggcttta gaggtcaaga cgatggtgaa 42 0 

tcagttcgta tggttgattt acgtaaggct ctagaacgtc atgccggttt tacaaaagat 480 

gatatcaaag caatttatga attacgtaat cctaaaaaag caaagtcggg tgcttctttt 54 0 

tctgatatga tgaaaccacc gagtactgct ggtgatttag ctgatttcac acgtgattta 600 

tcacagatgg cggttgatgg ggaaattgaa cctgttattg ggcgtgataa agaaatttca 660 

cgtatggtcc aagtcttaag tcgtaaaaca aagaataatc ctgtcttagt tggagatgcg 72 0 

ggtgtcggta aaacagcttt ggcgtacggt ctagcacaac gtattgctaa tggaaatatt 78 0 

ccttatgagc ttagagatat gcgtgttcta gagttagata tgatgagcgt tgttgctgga 84 0 

actcgttttc gtggtgattt tgaggaacgt atgaatcaaa tcattgcaga tattgaagaa 90 0 

gacggtcata ttattctttt cattgatgaa ctgcatacta ttatgggttc tggctctggt 960 

attgatagca cacttgatgc tgctaatatt ttaaaaccag ctttggcccg aggcacttta 102 0 

cgcactgttg gagcaacaac tcaagaggaa tatcaaaagc atattgaaaa agatgctgct 108 0 

ttatctcgtc gctttgcaaa agttttagta gaagaaccaa atcttgaaga cgcatatgaa 114 0 

attctacttg gattgaagcc agcgtatgaa gcctttcata atgttactat ttcagatgaa 1200 

gcagttatga ctgctgtaaa agtggctcat cgttatttga caagtaagaa tttaccagac 126 0 

tcagctatag acttacttga tgaagcttct gccactgtgc aaatgatgat taagaaaaat 132 0 

gctccttcgc ttttaacaga agttgaccaa gctattttag atgatgatat gaaatctgct 138 0 

tctaaagctc taaaagcaag ttataagggt aaaaagcgta agcctattgc cgtaacagaa 144 0 

gatcatatta tggcaactct cagtcgtcta tctggtattc ctgttgaaaa attgacacaa 15 00 

gctgatagta aaaaatacct caatttagaa aaagaattgc ataaacgtgt tatcggtcaa 1560 

gatgatgcag ttactgccat tagtcgtgct atccgtcgca atcaatctgg cattcgtact 162 0 

ggtaagcgtc ctattggttc ctttatgttc ttgggaccaa ctggtgttgg taagacagag 168 0 

ttagcaaaag ccttggcaga agtgttgttt gatgatgaat cagccctaat tcgttttgat 174 0 

atgtcggaat atatggaaaa attcgcagct tctcacctga atggggcacc tccaggttat 18 0 0 

gttggttatg atgaaggagg agagttgact gaaaaagtac gcaataaacc atattcagtt 186 0 

ttactttttg atgaagttga aaaagctcat ccagatattt ttaatgtatt attacaagtg 192 0 

cttgatgatg gcgttttaac agatagtcga ggacgaaaag ttgatttttc aaatactata 198 0 

atcattatga cttctaattt aggagcaact gcattacgtg acgataaaac agttggtttt 204 0 

ggagctaagg atatcagcca tgattatact gctatgcaaa aacgtattat ggaggagctt 210 0 

aaaaaagctt atcgaccaga gtttatcaat cgtattgatg agaaagttgt attccatagt 2160 

ctcagtcaag ataatatgcg tgaagttgtg aaaattatgg ttaaaccttt aattttagcg 222 0 

cttaaagata aaggtatgga tctgaaattc cagccttcag ccttgaaaca cttggcagag 228 0 

gatggctatg atatagaaat gggtgctcgt ccactgcgta gaactattca aacgcaagta 234 0 
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gaggatcatt tatcagaact actattggca aatcaagtga aagaggggca agtaattaag 2400 

attggcgtat ccaaaggcaa attgaaattt gatattgcaa aatcataa 2448 

SeqID 13 8 

atggtcttag ataaagaaat caaggctcaa ttagcgcaat atttagattt gctagaatct 60 

gatattgtat tacaagctga tttaggagat aacgataatt ctcaaaaagt aaaggatttt 12 0 

ctagatgaaa ttgtcgctat gtctgatcgc atcagccttg aatcaacaca tttaaaacgt 180 

caacctagtt ttggaattgc aaaaaaaggg catgaaagtc gcgttatttt ttcaggcttg 240 

ccaatggggc atgaattcac atctttcatt ttagctttgt tgcaagtctc tggtcgcgca 3 00 

ccaaaagttg atgaagatat tatcaaacga attaagggga ttgagaaaac tatcaatcta 360 

gaaacttatg tttcattaac ttgccataat tgtccagatg tcgttcaagc ctttaatata 420 

atggctgttt tgaatccgaa tatcactcat actatgattg aaggaggcat gtafccaagac 480 

gaagtcaaat caaaaggtat catgtctgtt cctactgttt ataaggacca agaagaattt 540 

acttctggcc gtgctacaat tgaacagctt ttagaacaac tagatggacc tcttgatgct 600 

gaagcatttg ctgacaaagg tgtctatgat gttctagtta ttggtggtgg tcctgcaggt 660 

aacagtgctg caatttatgc cgcccgtaaa gggttaaaaa caggtatttt agcagaaact 720 

tttggtggac aagttatcga aacagttggt attgagaaca tgattggtac actgtacaca 780 

gaaggcccta aattaatggc tcaaattgaa gagcatacca aatcttacga tattgatatt 840 

atcaagagtc aattagcaac tggcattgag aaaaaagaat tagttgaggt aacactggct 900 

aacggagcta ttttacaagc taaaactgct atcctagctc tgggtgctaa gtggcgaaat 960 

atcaacgttc ctggcgagga agaattccgc aataaaggtg ttacttactg tccacattgt 102 0 

gatggccctc ttttcgaagg aaaagatgtt gctgttatag gtggtggtaa ttcaggtatg 1080 

gaagctgctt tagatctagc cggtgtcact aaacatgtta cagtcctaga attcttacct 114 0 

gagctcaaag cagaccaagt actgcaagaa cgagcagcta aaactgacaa tttaactatt 120 0 

cttaagaatg ttgctactaa agatatcgtt ggtgaagatc atgtcactgg tttaaactat 1260 

accgatcgtg atacaaacga agaaaaacac attgatttag aaggtgtttt tgtacaaatt 132 0 

ggtcttgtac caagtacatc ttggctcaag gatagcggca ttgaactcaa tgaacgtcaa 13 8 0 

gaaattgtcg ttgataaatt tggctcaaca aatatccctg gtatttttgc tgcaggtgac 144 0 

tgtacagatg ctgcctacaa gcaaatcatt atttccatgg ggtctggtgc cacagctgcc 150 0 

attggcgcat ttgattattt aatcagacaa taa 1533 

SeqID 13 9 

atgataaaat atcaagatga tttttaccaa gcggttaatg gggagtgggc taagacagct 60 

gttattcctg atgacaaacc aaggacaggt ggtttttctg atcttgctga tgacattgaa 120 

gcattaatgc tgtcaacgac agataagtgg ctggctgatg aaaataaacc aagtgatacg 18 0 

attttaaatc attttattgc ttttcataaa atgacagctg attatcaaaa acgtgaagag 24 0 

gtaggagtat caccagtatt accattaatt gaagaatata aaggcctaca atctttctct 300 

gagtttgcat caaaagttgc agaatatgaa ctagagggta aaccaaatga atttccattt 360 

ggtgtggctc ctgactttat gaatgctcaa ttgaatgtac tttgggctga agcgcctggg 42 0 

attattttac cagatacaac ttattatagt gaagataatg aaaaaggaaa ggaactcctt 480 

gctttttggc gtaaaagtca agaagaccta ctgcctttgt ttggcctttc agaacaggaa 54 0 

attaaagata tcttggataa ggttttagca ttggatgcaa aattggcaca atatgtctta 60 0 

tctcgcgaag agagttcgga atatgttaaa ctttaccatc cttataactg ggaagatttc 66 0 

acaaaattag cacctgaact tccattagat gctattttcc aaaagatatt aggacaaaaa 72 0 

ccagataaag ttattgttcc tgaagaacgt ttctggacag aatttgcatc tgactattac 78 0 

tcagaatcta attgggaatt attaaaggca gatcttattt tatcagctgc gaatgcctat 84 0 

aatgcttact taactgatga tattcgtatc aaatcgggag tttacagtcg agctttatca 90 0 

ggcacaccac aagcaatgga taagaaaaaa gcagcttact accttgcttc aggaccttac 96 0 

aatcaagctc taggtttgtg gtatgcagga gaaaaattct ctccagaagc taaagctgat 1020 

gttgagcata agattgcaac gatgattgat gtctataagt cacgcttaga aaaggcggat 108 0 

tggcttgcac agtcaacgcg agaaaaagct attatgaaac ttaatgttat cacaccacat 114 0 

attggatacc ctgagaaact accagagact tatactaaaa aaatcattga tcctaaacta 1200 

tcgcttgtcg aaaatgctac taatttagac aaaatttcaa tagcctatgg ttggagtaaa 126 0 

tggaataaac cagttgatag aagtgaatgg catatgcctg cccacatggt caatgcgtat 132 0 

tatgatccac aacaaaacca aattgtcttc ccagcagcca ttttacaaga accattfctat 13 8 0 

gctttagagc aaagttcatc agcaaattat gggggaattg gtgcagtgat tgcacatgaa 144 0 

atatcacacg cttttgacac taatggagct tcatttgatg aacatggaag tttgaataat 150 0 

tggtggactg atgaagattt tgaagctttc aaaaaactta cagataaagt agttgagcag 156 0 

tttgatggac tggaatctta tggcgcaaaa gtcaatggta aactaacagt atcagaaaat 162 0 

gttgcagatt taggtggagt tgcttgtgct ttagaagctg cgcagcgtga gtcagatttt 168 0 

tcagcgaggg atttctttat taactttgcg acaatttggc gtatgaaggc acgtgatgag 174 0 

tatatgcaaa tgttagctag tgtagatgtt cacgccccag cacaatggcg aactaatatt 1800 

acagtaacga attttgaaga attccataaa gagtttgatg ttaaagatgg tgataacatg 186 0 

tggcgtccag ttgaaaaacg tgttattatt tggtaa 1896 
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SeqID 14 0 

atgtctaaga ttcttttttt cggtgaacct ctcattcgca tcactccaaa agaaaatgac 60 

tattttgcag atagtatctc aactaaacta ttctatggtg gctctgaagt taataccgca 120 

agagctctac aaggctttgg gcaagatact aaactcctct cagccttacc caataatccc 180 

ataggtaata gttttcttca atttttaaaa gcacagggta ttgatactca tfcctattcag 24 0 

tgggttggag aacgtgttgg gttatatttt ttagaagatt cctttgcttg cagaaaaggt 3 00 

gaggtggtct acgaccgcga tcatagtagt ttacatgact ttaggataaa tcaaattgat 3 60 

ttcgatcaat tatttgaagg agtttctctc tttcatttta gtggtattac actatcattg 42 0 

gatgagtcta tccaagagat aacactgctt cttttaaaag aagctaaaaa aagagagata 48 0 

acaatctcct tagatctcaa ttttcgcagt aaactaattt ctectaaaaa tgccaaaatc 54 0 

cttttttctc agttcgctac ctttgcggat atttgttttg gtattgaacc tcttatggtt 60 0 

gactcacagg acactacctt cttcaatcgc gatgaagcta caatagagga cgtcaaagaa 660 

cgaatgattt cgcttataaa tcactttgac tttcaagtca ttttccatac taaacgtcta 720 

caggafcgaat ggggacgaaa ccactaccaa gcttacatag caaatcgtaa acaagagttt 780 

gtcacttcaa aagaaafccac cacagctgta aaccagagaa ttggaagcgg agatgctttt 840 

gtggcgggag cgctatatca actacttcag cactcagaca gcaagacagt cattgatttc 900 

gctgtcgcta gtgcatcttt aaaatgtgct ttggaagggg acaatatgtt tgagaccgtt 960 

accgcagtta acaaagtact aaatcaatcg aaagacatta ttcgttag 10 08 

SeqID 141 

atgaaaagtc gaaaaaaaga taaattggta ttgaggttaa caacaacact attggttttt 60 

ggtttgggtg gggtttggtt ttataattat aaaaatgata atgtcgaacc gacagtcact 12 0 

agtgcatcgg atcaaacgac gacttttatt caaacgattt ctccaacagc tattgaaatt 180 

tctaagacct atgatttgta tgcgtcagtc ttattagcac aagctatttt ggaatcatcc 240 

agtggacaat cagatttgtc taaggctcct aattataacc tctttggcat caaaggagaa 3 00 

tataaaggta aatctgtcca aatgcctact ttagaagatg atgggaaagg caatatgact 3 60 

caaatccaag ctccttttcg cgcctatcca aatfcattctg cttcactata tgattatgct 420 

gagttagtat ctagtcaaaa gtatgcatct gtttggaaat caaatacctc ttcttataag 4 80 

gatgctactg cagctctaac aggtctttat gcgacagata ctgcttatgc tagtaaatta 540 

aaccaaatta ttgaaaccta cagtctagat gcttatgata aatag 585 

SeqID 142 

atggcagacg tgaaggtagt gaataatgag gattctagag gtcaaaagca agacttaaag 60 

gctaaactat ttcatattaa gatagggtca gttcccctac cagtatatgt ttgtttagca 12 0 

ttattgattc ttctagcagg ctttttacaa aaattgccag tcaatatgct aggaggattt 18 0 

gcagttatct taacaatggg gtggttctta gggactatcg gagctagcat tcctggtttt 24 0 

aaaaacttcg gtggcccagc tattttatct ttattagtac catctatttt ggtgtttttc 3 00 

aacctcatta ataaaaatgt tttagaatca acaaatatgt tgatgaagca agctaacttt 3 60 

ctttattttt atattgcttg tttagtgtcc ggtagtattt tagggatgaa tcggaaaatg 420 

ttgattcagg gattgctaag aatgattttc cccatgttat taggaatggt ttgtgcgatg 48 0 

atggtaggga catttgtcgg tgttatttta ggcttagagt ggcgacacac tttgttttat 54 0 

atcgtaacac ccgttttagc tggtggtatt ggtgaaggta ttttaccatt atcgttaggc 600 

tatagtfccaa ttaccggtgt agctagtgaa caactagttg ctcaactcat cccagccact 66 0 

attattggta atttctttgc cattttatgt actgcactat tgaatcgttt gggagaaaag 72 0 

aaaccacact tgtctggtca agggcaatta gtaaggttaa ataaaggaga ggacatgtca 78 0 

gatattattg ctgatcattc tggcccaatt gacgttaaga aaatgggtgg aggtgtttta 84 0 

acagcatgta gtctctttat ttttggacat ttgttgcagc aattaactgg atttcctggt 900 

cccgtattaa tgattgttgc agcagctatt ttgaaatata ttaatgttat tcctagagaa 96 0 

acacaaaatg gagctaagca actttataaa tttatttctg gtaattttac atttcctcta 102 0 

atggcagggc taggattgct ttatatcccg ttaaaagatg ttgtggcaac gcttagcata 108 0 

caatatttca tagttgttat tagtgttgta tttacagtfca tttctgttgg attctttgta 114 0 

tcgcgattcc ttaatatgaa tcctgttgaa gcaggtatta tttcagcttg tcaaagtggt 12 0 0 

at 9"ggaggaa caggagatgt tgccatttta agtacagcag accgaatgaa cttgatgcca 12 6 0 

tttgctcaag ttgctacgcg tttaggagga gctattactg ttatcacaat gacagccatt 1320 

ttacgcatgt tattctaa 133 8 

SeqID 143 

agaggcaaaa aagctggtaa atacacaaca tctgatggtt acatttttga tgccaaagat 60 

attaaaaaag atacaggtac aggttatgtc attccacata tgacacatga gcattgggta 12 0 

ccaaagaaag atttatcaga gtcggaatta aaagcagctc aagaatttct ttcaggaaaa 18 0 

tctgaagcaa atcaagacaa accaaaaaca ggtaaaacag ctcaagaaat ctatgaggca 24 0 

attgaaccaa aagcaattgt taaacctgaa gatttattat ttggaattgc acaagcgaca 3 00 

gactataaga atggtacatt tgtaattcct cataaagatc attaccatta tgtggaatta 360 

aaatggtttg atgaagaaaa agatctttta gctgattcag ataagacata ttctttagaa 42 0 

gactatttag ctacggctaa atattacatg atgcacccag aaaaacgtcc taaagttgaa 48 0 
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ggatggggta aagatgctga aatttataag gaaaaggact ctaataaagc agataaacca 540 

agtcctgcac caactgataa taaatcaaca tcaaattcta gtgacaaaaa cttaagtgct 600 

gcagaagtat tcaaacaagc aaaaccagaa aaaattgtac cgcttgataa aattgctgct 660 

cacatggcat atgcagttgg atttgaagat gatcaattga ttgttcctca tcatgatcat 720 

tatcataatg fctcctatggc atggtttgac aagggtggtt tatggaaagc accagaaggc 780 

tatacattac aacaactctt ctcaacaatt aaatactaca tggaacatcc taatgaatta 840 

ccaaaagaaa aaggttgggg acacgacagt gatcataaca aaggctcaaa taaagacaat 900 

aaagccaaaa attatgctcc agatgaagaa cctgaagatt cagggaaagt aactcacaac 960 

tatggttttt atgatgttaa taaaggttca gacgaagaag aaccagaaaa acaagaagat 1020 

gaatcagagc tagatgaata tgaactagga atggcacaaa atgctaagaa atatggtatg 1080 

gatagacaat cttttgaaaa gcaactcatc caattatcaa ataaatatag tgtaagtttt 1140 

gaaagcttta attatattaa tggtagtcaa gttcaagtta ctaaaaaaga tggctctaaa 1200 

gtgcttgttg atattaaaac actaacagaa gtgaaataa 123 9 

SeqID 144 

atgaatcgta aaaaaacagt tattattagt gctttatcag tagctctatt tggtactggt 60 

gtaggtgctt accaattagg tagctataat gcccaaaaat cagacaatag tgtttcttat 120 

gtgaaaacag ataaatctga tagcaaagca caagcaacag ctgtaaataa aacgcccgac 180 

caaatcagta aagaagaagg catatctgca gaacaaatcg ttgttaaaat tactgacgac 240 

ggttatgtga cttctcatgg agaccattac cattattaca atgggaaagt gccatatgat 3 00 

gccatcatta gtgaagaatt gattatgaaa gaccctagct atgtctttaa taaagctgat 3 60 

gtcattaatg aagtcaaaga tggttatatt atcaaagtta acggtaagta ttacctctat 420 

cttaaagaag gtagcaaacg aacaaatgtt cgtacaaagg aacaaatcca aaagcaacgc 480 

gaagaatggt ctaagggtgg ttctaaggga gaatcaggaa agcatagtag tgcaaaaact 540 

caagctttat cagccagcgt acgtgaagcg aaagctagcg gtcgctatac tacagatgat 600 

ggttatgttt ttagtccaac agatgtcatt gafcgacatgg gagatgcctt tttagtaeca 660 

catggcgatc atttccatta tatcccgaaa gctgatttat ctccatcaga gttatcagcc 720 

gcacaggcat attggaaccg caagactggc cgatcaggta atagctcaaa accatcaaat 780 

tcaagttcat acattcatgc aagtgcacca agcggtaatg tatcaacagg aagacatgct 840 

aatgccccta tttctattcc aagagtcact cacgcaaatc actggtcaaa accagctggt 900 

aatcatgcta cggcaccaaa acaccatgca cctacaacca aaccaatcaa caaagattct 9 60 

gctttagaca aaatgttgaa acgtttatat gctcaaccgt tatatgctcg tcacgtagaa 102 0 

agtgatggtt tggtttatga cccagcacaa gttaatgctt tcacagcaat cggtgtttct 108 0 

attccgcacg gtaatcattt ccactttatt cactataagg atatgtctcc attagagtta 114 0 

gaagcaacaa ggatggtggc agagcataga ggacatcata ttgatgcatt agggaaaaaa 12 0 0 

gattctacag agaaaccaaa gcatatttct catgaaccta ataaggaacc tcacacagag 12 6 0 

gaagaacacc atgcagtaac accgaaagac caacgtaaag gcaaaccaaa tagccagatt 13 2 0 

gtctacagtg ctcaagaaat tgaagaggca aaaaagctgg taaatacaca acatctgatg 13 8 0 

gttacatttt tgatgccaaa gatattaaaa aagatacagg tacaggttat gtcattccac 144 0 

atatga 144 6 

SeqID 14 5 

atggctaaat ttggattttt atcggtacta gaagaagaat tggataagca tttacagtat 6 0 

gactttgcca tggattggga taagaaaaat catactgtag aagtcacttt tatattagaa 12 0 

gcacaaaaca gctctgctat tgaaaccgtg gacgaccagg gggagacgag tagcgaggac 18 0 

atcgtttttg aggactatgt actcttttac aaccctgtca agtctcgctt tgatgctgag 24 0 

gactatctgg tgactatccc ctatgagcct aagaagggct tgtcgcgtga gtttctggct 3 00 

tattttgcgg agactctcaa cgaggttgcc actgaagggt tgagcgacct catggacttc 360 

ctcaccgatg acagtatcga agagttcggt ctttcttggg atacggatgc ttttgaaaat 42 0 

ggcagagcag aactgaaaga gacagagtfct tacccttatc cgagatatta g 471 

SeqID 146 

atgaacacaa aacagcgttt ttcaatccgg aaatataagt taggtgccgt atctgtactt 60 

ttgggaaccc tatttttttt aggtggtatc acaaatgtag ctgctgattc tgtcataaat 120 

aagccatctg atattgcagt tgaacagcaa gtaaaagaca gtccaacgag catagcaaat 18 0 

gagacaccta ctaacaacac gtcatcagcc cfctgcgtcga cagctcaaga caatcttgtt 24 0 

acaaaggcta ataatagtcc aacagaaaca caaccagtag ctgagtctca ctctcaagcc 3 00 

accgagacat tttccccagt cgcaaatcaa ccggttgaaa gcactcaaga agtttctaaa 3 60 

actcctttaa ccaaacaaaa tttagcagtc aaatctacac cagctatttc taaagaaacc 420 

cctcaaaaca ttgatagtaa taaaattatc actgtcccca aagtatggaa cacaggctac 48 0 

aaaggagagg gaactgttgt agcaattatt gactcaggac tagatatcaa tcacgatgct 540 

ctccaattaa atgattcgac aaaagcaaaa taccaaaacg aacagcaaat gaatgctgct 600 

aaagcaaaag ctggtataaa ctatggaaaa tggtataaca acaaagtaat ctttggtcac 660 

aactatgttg atgtcaatac agagctaaaa gaggtgaaaa gcacttctca tggtatgcac 720 

gtaaccagta tcgcaacagc taatcctagc aagaaagata caaatgaatt aatctatggt 780 
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gttgctcctg 
ggaccagccc 
aatttaagtt 
gctttagaga 
acatttggga 
agtccatcaa 
aataaagtct 
gcttatgcag 
gttttcgtag 
gccttaattg 
ggtgctgtgg 
ttagatcctg 
ttagctaaaa 
aatgcaggtg 
cctgacttat 
atgafcgagtg 
caatacttat 
taccttctta 
ccacgccaac 
ctaactggtg 
tcctttgatg 
tatttaaata 
acatttacag 
atagatgttt 
gaaggctacg 
gttggattta 
gttgctaaca 
tcagaagatt 
actagtccta 
ttacaactag 
caagatgccg 
gtctatcgtg 
ggagataaaa 
acagaatgga 
gttttaactt 
actttggata 
ttcaaagctc 
tacttaaaaa 
ggtatccttg 
ctacctaaag 
aatctagtat 
aacgtcaaag 
gtcaaagatg 
ttgaaattgc 
aatctaatca 
gtcttattta 
ttaccaaaag 
aaagcaactt 
aacgtaacgc 
gtaaaagaag 
aatgccctag 
cacttaaaaa 
gtggaacaaa 
aacggtaaag 
aagggaaact 
gtccaattgg 
gccttagaaa 
ggtttacacc 
aacgccagtc 
ctttccaata 
accgaacaag 
ctcaatgata 
ttgatattca 
gtgactagcc 
agcactccta 



aagcacaagt 
tttatgttaa 
taggtggagc 
tggctagact 
gtggagcatc 
cagctcgtga 
tcaacattat 
atcctaaagt 
gaaaaggaaa 
aacgtggaga 
gagctattat 
aagcaagtgc 
acaactataa 
tcctatctga 
ctgctcctgg 
ggacaagtat 
tgaaagaaca 
tgagtactgc 
aaggagcagg 
gggaaaacaa 
ttactgttca 
ctgatcaagt 
ggaaaacgat 
cgaaatacca 
tacgttttac 
agggagaatt 
aagaaaaggg 
atactgcctt 
tccaattgaa 
atcaaaaagg 
ttgcagtgaa 
cagatgatgt 
attactacag 
aaggaaccac 
attactctga 
gacaagctcc 
gtcctgcagt 
aagataaaga 
ttgaagataa 
aggttaacga 
cagcaaaact 
tgtttagccc 
acaaaggtaa 
cttttggtac 
gtcccaaaag 
aagttaactt 
gagcaacagt 
attctcctac 
tgcctagtgg 
atcaggtaaa 
cagaacaaac 
ataattacct 
caagtattga 
aaacagatac 
atcaatttgt 
cgaaaaacct 
atcttgatat 
atatgataat 
aatttgctaa 
gacaagctac 
agcttgatgg 
atgactctct 
aaaaaggcat 
aaaccgataa 
aaataaccta 



aatgtttatg 
agctattgaa 
taatgggtct 
cgctggcgtt 
aaagccttct 
ggccatttct 
cggattagaa 
tagtgataag 
cgacaatgat 
tattactttt 
ctttaataac 
tattcctgct 
aattgtattt 
cttttcaagc 
aggctctatt 
ggcttctccc 
tccagaactt 
taaagcacac 
tattatcgat 
ctatggcagc 
taatatcaat 
taaagatggc 
acggattgaa 
tgacatgtta 
agaccctgtt 
ccaaaactta 
attttatttc 
aatgactaca 
agccttggga 
ccagcctcat 
aggtgttttc 
taatttacaa 
cggaaatact 
tactgatggt 
tgtccctggc 
tacactaaca 
agaacacggg 
tggtcattat 
caaagtattt 
tttctctcat 
cgaagatttg 
tgagcttaac 
tatcatcaaa 
ctatacgttt 
tgtcactgta 
actcaagaaa 
ccagttggtt 
tgactatggt 
gtatagcact 
cctaacaaaa 
tgatattatt 
agctaatctt 
taatgctatt 
ttctttactg 
taatgctagt 
tctacaaaaa 
tgctaaaaat 
taaagcaaac 
agaaaattat 
acaagctcaa 
ccgtgataga 
caataccaca 
gactaaagaa 
tcaaaaggta 
tacaatattg 
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agagtcttct 
gatgccgtta 
ttagttaatg 
tctgttgtta 
gctctttatc 
gtagcatcat 
aacaacagaa 
acctttgaag 
tataaggaca 
acaaaaaaag 
aaagctgggg 
atttttaccc 
aacaatatca 
tgggggttaa 
tacgccgcta 
catgtcgctg 
aaaaaaggtg 
cta.aacaa.ag 
gtcgcagcag 
gttacattag 
aaagttgcaa 
tttgtcacat 
ccagggcaaa 
aaaaaagtaa 
gatggtgggg 
gaagttttag 
caaccaaaac 
agttcagagc 
agctataagt 
ctagccattt 
ttacgtaatt 
aaaccactat 
gaaaatccaa 
attcctttag 
tctaagccac 
acagcaactt 
gaatctggta 
aatagcgtct 
atcaaacaag 
gtctactata 
atcaatatfcg 
agtaatgtcg 
aagcaacatc 
gatctattct 
actatttctg 
gcagccttac 
actaagacaa 
aaaaacatac 
ttagagaact 
ttgacgctga 
acccagcctg 
gaaaaggcac 
gctgctttga 
gcaaaagcta 
ccattaagcc 
cctaacgtca 
caattaaatg 
gttctgaaac 
aataacctta 
gttgaagagt 
gtttcttccg 
cctatcaatc 
agtgaggttg 
aagacaaaca 
ctatttagtc 



ctgatgaaaa 
aactcggtgc 
ccgatgaccg 
tagcagcagg 
ctgattatgg 
ataataatac 
atctcaacaa 
tagggaaaca 
aaactttaaa 
tcgtcaatgc 
aagctaatct 
aaaaagagtt 
aaaataaaca 
cagcagacgg 
tcaatgataa 
gtgctactgc 
acattgaaag 
atacaggcgc 
cagttcagac 
gaaatattaa 
aagatttaca 
tggctcctca 
cccaaacgat 
tgccaaacgg 
aagttcttag 
aaaaatccat 
aaacaaacga 
ctatctactc 
ctatagatgg 
cacctaatga 
tcaataattt 
gggtaagtgc 
aatctacatt 
aagatggaaa 
aacaaatggt 
atgacaaaga 
tctttagaga 
tacgtcaaca 
aaaagaatgg 
ctgttgaaga 
gcaataaaaa 
atattgattt 
acgggaaaga 
tatacgatga 
aaaaagatag 
tcgttgaatt 
atactgttgt 
ctgtaggaga 
tagatgattt 
ttaataaagc 
tgttttataa 
aaactttaat 
gagaaagtcg 
ttttagctga 
aatcaactta 
ctcaatcaga 
gtcatgaaac 
aaacatcatc 
tcaagaaagc 
tattaaacca 
cagagaatta 
cgccaaatca 
ctcagaagcg 
agcttcctaa 
tctctatgct 



aagaggaact 
tgacagcatt 
acttataaaa 
taacgacggt 
tttagttggt 
aacactggtt 
cggattagct 
atatgattat 
tggtaaaatc 
tattaatcac 
aacaatgagt 
tggagatgtt 
agccaaccct 
acaattaaaa 
tgaatatgat 
tctagttaaa 
aactgtcaaa 
ttacacctca 
aggattatac 
agataaaatt 
ctatacaacc 
acaacttggt 
tacaattgat 
ctatttccta 
tattccttat 
ttataagctt 
agttcctggt 
aacagacggt 
aaaatggatc 
tgaccaaaat 
aagagccaaa 
tccccaagca 
tttatatgac 
atacaaatac 
gtttgatatc 
tagacgtatc 
acaagtttgt 
aggagaagac 
tagctttatt 
ttatgcaggc 
tggtttagfca 
ctcttactct 
cctcaattta 
ggaacgagca 
ccttaaagac 
tgacaagctt 
tgatctacca 
ctatcgttta 
acttgtatcc 
tcctctgatt 
tgctggaact 
taaaaataga 
ccaagctctt 
aacagaaatc 
tatcaatcaa 
agtagacaaa 
tgattactct 
taaatatcag 
agaattgctg 
aataaaagca 
tagtcaatca 
gccccaggcg 
tgtcttaggg 
aacaggcgaa 
aggtctggca 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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acaatcaaac taaagtctat caaaagagaa taa 



4713 



aaaaaaaatt aa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



SeqlD 147 

atgaataata acgaaaaaaa agtaaaatac tttttaagaa aaacagctta tggtttggcc 
tcaatgtcag cagcgtttgc tgtatgtagt ggtattgtac acgcggatac tagttcagga 
atatcggctt caattcctca taagaaacaa gttaatttag gggcggttac tctgaagaat 
ttgatttcta aatatcgtgg taatgacaaa gctattgcta tacttttaag tagagtaaat 
gattttaata gagcatcaca ggatacactt ccacaattaa ttaatagtac tgaagcagaa 
attagaaata ttttatatca aggacaaatt ggtaagcaaa ataaaccaag tgtaactaca 
catgctaaag ttagtgatca agaactaggt aagcagtcaa gacgttctca agatatcatt 
aagtcattag gtttcctttc atcagaccaa aaagatattt tagttaaatc tattagctct 
tcaaaagatt cgcaacttat tcttaaattt gtaactcaag ccacgcaact gaataatgct 
gaatcaacaa aagctaagca aatggctcaa aatgacgtgg ccttaataaa aaatataagc 
cccgaagtct tagaagaata taaagaaaaa attcaaagag ctagcactaa gagtcaagtt t>60 
gatgagtttg tagcagaagc taaaaaagtt gttaattcca ataaagaaac gttggtaaat 
caggccaatg gtaaaaagca agaaattgct aagttagaaa atttatctaa cgatgaaatg 
ttgagatata atactgcaat tgataatgta gtgaaacagt ataatgaagg taagctcaat 
attactgctg caatgaatgc tttaaatagt attaagcaag cagcacagga agttgcccag 
aaaaacttac aaaagcagta tgctaaaaaa attgaaagaa taagttcaaa aggattagcg 
ttatctaaaa aggctaaaga aatttatgaa aagcataaaa gtattttgcc tacacctgga 
tattatgcag actctgtggg aacttatttg aataggttta gagataaaca aactttcgga 
aataggagtg tttggactgg tcaaagtgga ct'tgatgaag caaaaaaaat gcttgatgaa 
gtcaaaaagc ttttaaaaga acttcaagac cttaccagag gtactaaaga agataaaaaa 
ccagacgtta agccagaagc caaaccagag gccaaaccaa atattcaagt acctaaacaa 12 60 
gcacctacag aagctgcaaa accagctttg tcaccagaag ccttgacaag attgactaca 132 0 
tggtataatc aagctaaaga tctgcttaaa gatgatcaag taaaggacaa atacgtagat 13 8 0 
atacttgcag ttcaaaaagc tgttgaccaa gcttatgatc atgtggaaga gggaaaattt 144 0 
attaccactg atcaagcaaa tcaattagct aacaagctac gtgatgcttt acaaagttta 150 0 
gaattaaaag ataaaaaagt agccaaacca gaagccaaac cagaggccaa accagaagct 1560 
aagccagaag ctaagccaga agctaagcca gaagctaagc cagaggccaa accagaagct 162 0 
aagccagacg ttaagccaga agctaaacca gacgttaaac cagaggctaa gccagaagct 168 0 
aaaccagagg ctaagtcaga agctaaacca gaggctaagc tagaagctaa accagaggcc 174 0 
aaaccagcaa ccaaaaaatc ggttaatact agcggaaact tggcggctaa aaaagctatt 18 00 
gaaaacaaaa agtatagtaa aaaattacca tcaacgggtg aagccgcaag tccactctta 18 60 
gcaattgtat cactaattgt tatgttaagt gcaggtctta ttacgatagt tttaaagcat 1920 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



SeqlD 14 8 

atggcaaaag atattaaatt ttcagcagat gcccgctcag caatggtgcg tggtgttgat 60 

attttagctg atacagtcaa agtaacatta ggtcctaaag gccgtaatgt tgttcttgaa 120 

aaagcatttg gttctccttt aattacaaat gatggtgtga caattgctaa agaaattgag 180 

ctagaagatc actttgaaaa tatgggagct aaacttgtgt cagaagtggc ttcaaaaact 240 

aatgatattg caggggatgg cactacaact gctactgttt tgacccaagc tattgtacgg 300 

gaaggtctta aaaatgtaac tgcaggggca aatccgattg gcattcgtcg tggtattgaa 3 60 

acagctgttt cagcagcagt tgaagagcta aaagagattg cacaaccagt ttcaggcaaa 420 

gaagctattg ctcaagttgc ggctgtgtct tcacgttctg aaaaagttgg ggaatacatt 480 

tctgaagcta tggggcgcgt gggtaatgat ggtgttatca ctattgaaga atcgcgaggt 540 

atggaaacag agcttgaagt tgtggaagga atgcagtttg accgtgggta cttgtcacag 60 0 

tatatggtaa ctgataacga gaaaatggtc tctgaacttg agaatccgta tatccttatt 66 0 

acagataaga aaatttcaaa tatccaagaa attttaccat tattagaaga ggttcttaaa 72 0 

acaaatcgtc cgttgctaat catcgctgat gatgttgatg gagaagctct cccaacgctt 78 0 

gttcttaaca aaattcgtgg aactttcaat gtcgtagctg ttaaagcgcc tggatttggt 840 

gatcgtcgta aagccatgct ggaagatatt gctatcctaa caggaggaac tgtcgttact 90 0 

gaagaccttg gtttagactt aaaagatgct actatgcaag ttttaggaca gtctgctaaa 960 

gtaacagtag ataaagattc tactgttatt gtcgaaggtg ccggtgactc atcagcaatt 102 0 

gctaatcgcg tagctatcat taagtcacag atggaggcta caacttctga ttttgatcgt 108 0 

gaaaaattac aagaacgact tgctaagtta gccggtggtg tagcagtaat taaagttggt 114 0 

gcagcgactg aaacagaatt aaaagagatg aaacttcgca tcgaagatgc gttaaatgca 1200 

acgcgtgctg cagttgaaga aggtattgtt tcaggtggag gtacggctct tgtgaacgtt 126 0 

attgaaaaag tagcggcact gaaacttaat ggtgatgagg agactggacg taatattgtt 132 0 

cttcgtgctc tcgaagagcc tgttcgtcaa attgcttaca atgctggata tgaaggttca 13 8 0 

gttattattg aacgtttaaa acagtctgaa attggtacag gatttaatgc ggccaatgga 144 0 

gaatgggtag atatggttac cacaggtatc attgaccctg tcaaagtaac acgttctgca 15 0 0 

cttcaaaatg cggcatctgt agcaagtctt atcttgacta cagaagcagt agtagcaaat 156 0 

aaacctgaac cagaagctcc tacagctcct gcaatggatc catctatgat gggtggcttc 162 0 
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SeqID 149 ^ 

gtggtagaaa acctagaaaa accaattgga gtgagctata aaaatagccc atcaatgagt 60 

aaacgaacgg ctatcagaat gaagaaatct agtcgatttt caattttact atatagtgtt 120 

ttgtcaactc tcttagcaat tgcaaatcct ttattgactt attttgcaaa tggtttgcag 180 

actcagaatc tctatacagg tttgatgatg accaaaggac agatccctta tagtgatgtt 240 

tttgctacag gaggattttt atactatgta acgattgctc taagttacct tttagggtct 3 00 

agtatctggt tacttattgt acagtttatt gcttactatg tatctggaat ttatttttat 3 60 

aaattagttt attatgtggc acaaagtgaa attgtctcga taggcatgac gttgattttc 420 

tatataatga atattgtctt aggattcggfc ggtatgtacc caatacagtg ggcattacct 480 

tttatgctca tttcgctatg gtttttaatt aaattttgtg tcgataatat cgttgatgaa 540 

gcatttatat tttatggcat tttagcagca ttctcactat ttatagatcc ccaaacactg 60 0 

attttttggc tttgctcttt tgttctctta acagcaacca atattaagca aaagcagtca 66 0 

ctacgaggat tttatcagtt tttatgcgtt gfcttttggta tgattttaat tgcttatact 72 0 

gttggttact ttatgtttaa tctacagatt atctcttctt atattgacaa agctattttt 780 

taccccttta cgtatttcgc taggacaaac cactcattcc ttcttagttt agcaattcaa 840 

attgttgtcc ttctagggtc aggttgtctc tttggactat gggattttat tcaaaatcgt 900 

aaaaaagcat cttatcaaat cggcttgaac tttattgcct gtatttttat tatttacgct 96 0 

ataatggcca ttttttcaag agattttaac ctttaccact ttttacctgc tttaccattt 102 0 

ggtcttttgc taacgagtaa taaaattact attctttacc aaaaagtgat tgacagacgt 108 0 

tcacatagac gtcagtattt ctctggtaaa tctcttatcg tagacctttt tgtaaagaaa 114 0 

acatattatt tgcccctctt actagtttcc ttatcgatag ggttactagt ttataatact 12 00 

tatcaaaatg ttactcttag taaagaaagg cgtgacattt cacattattt gactacaaaa 12 60 

attgatagag atggtaagat ttatgtttgg gacaaagttg ctagcattta tagccagact 13 2 0 

cggttaaaat ctgcttctca atttgttttg ccacatatta atacagcgca aaaaaataat 13 8 0 

gagaaaattt tgaaggacga gctgcttcag catggggcta aatactttat cttaaataaa 144 0 

aatgagaagt taccaaatga actgaaatct gatattaaga agcactacca agaagttcct 15 0 0 

ctatcaaata tcactcattt tgttctatat cgtttcaaat ag 1542 

SeqID 150 

atggccaagc caacgatatc accgggaatg caacagtatc tggatataaa agagaattat 60 

ccagatgctt ttttgctttt tagaatgggt gatttttatg aattatttta tgatgatgcg 120 

gtaaaagcag cacaaatcct ggaaattagc ttgactagtc gaaataagaa cgcagaaaag 180 

ccaatcccaa tggcaggagt tccctatcac tcagctcaac agtatattga cgttttagtt 240 

gaattaggtt acaaagtagc cattgctgag cagatggaag atcctaaaaa agctgtggga 3 00 

gtggtcaagc gtgaggtagt gcaagtfcgtt accccaggaa cggttgtgga gtcaacgaag 3 60 

ccggatagtg ctaataattt cttagtagcg attgattcgc aagatcaaca aacatttggt 420 

ctagcatata tggatgtctc aactggagag tttcaggcaa cccttttaac agattttgag 4 80 

tccgtccgta gtgaaatact aaatttaaaa gctcgtgaga ttgtagtagg atatcaattg 540 

acggacgaaa aaaatcacct actgacgaag cagatgaact tgcttttatc atacgaagac 600 

gaacgactta atgatattca tttgattgat gagcagttaa ctgatttgga aatatctgct 660 

gcggaaaaac ttttacaata tgtgcataga acacaaaagc gtgaacttag tcatttacag 720 

aaagtagttc attatgaaat aaaggactat ttacaaatgt catatgcaac gaaaaatagt 780 

ctagatttac tggagaatgc tagaacaagc aagaagcatg gaagtcttta ctggttgtta 840 

gatgagacta aaacagcgat gggaactcga atgctgagaa cttggattga caggcctttg 900 

gtaagtatga atcgaatcaa ggaacgtcaa gatattattc aagtgtttct tgattatttt 960 

tttgagagaa acgatctcac agaaagttta aagggtgtat atgatattga acgcctagca 1020 

agtcgagtat ctttcggaaa agccaaccct aaagatctat tgcaactcgg acagacctta 1080 

tcacaaattc ctcggattaa aatgatttta cagtccttca atcaacctga gcttgacatc 114 0 

attgtcaaca aaatfcgacac tatgcctgaa ttagaaagtt taattaatac ggcgatagcc 1200 

ccagaagcac aggctactat cactgaggga aacattatca agtctggatt tgataagcaa 126 0 

ttggataatt atcgaacagt gatgcgtgaa ggtacaggtt ggattgctga tattgaagct 132 0 

aaggaaagag cagcaagtgg tatcggtact cttaaaattg attataataa aaaagacggt 13 8 0 

tattacttcc atgttaccaa ttccaattta tcactagtac cggagcattt tttccgtaaa 144 0 

gcgacattaa aaaattctga acgctatgga acagcagaac tagccaaaat tgaaggtgaa 150 0 

atgctcgaag ctcgcgagca atcttcaaat ttagaatatg atatttttat gcgtgttcgt 156 0 

gcccaagtag aatcttatat taaacgtctt caagagttag caaagacgat tgcaaccgtt 162 0 

gatgttctac agagtttggc agtagttgca gaaaattatc actatgttcg tcccaaattt 1680 

aatgatcaac atcagattaa gattaagaat gggcgtcatg caactgttga aaaagtgatg 174 0 

ggagtgcaag aatatattcc caatagcatc tattttgata gtcagacaga tatccagttg 18 0 0 

attacaggac caaatatgag tggtaagtcg acctatatgc gccagttagc tttgacagtt 18 6 0 

attatggcac aaatgggagg ttttgtatcg gcagacgaag ttgatttgcc tgtatttgat 192 0 

gcaatattta ctaggattgg tgctgctgac gacttaattt ctgggcaatc aacctttatg 19 8 0 

gtagaaatga tggaagcgaa tcaagctgta aaacgagcca gtgataaatc tttgattctt 2 04 0 
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tttgatgaat taggtcgagg gacagccact tatgatggta tggcattagc tcaatcgatt 2100 

atagaatata ttcatgaccg tgttagggca aaaacaatgt ttgcgactca ttaccatgag 2160 

ttgacagatt tatctgaaca gttgacaagg cttgtcaatg tacacgtggc tactttagag 222 0 

agagatggag aagttacctt cttacataaa attgaatctg gacctgcgga taagtcttat 228 0 

gggatacacg tcgcaaaaat agctggttta ccaattgact tattggatag ggcaactgat 234 0 

attttatcac agttggaagc tgatgcagta cagttgatcg tatcgccctc ccaagaagct 24 0 0 

gttactgctg acttaaatga ggaactagat tctgagaagc aacaaggaca attatcgctt 24 60 

tttgaagaac cttcaaatgc aggtagggtt attgaggagt tagaagcgat agatataatg 252 0 

aatctaactc caatgcaagc tatgaatgct atatttgact taaagaaatt attataa 2577 

SegID 151 

atgttgaaac tggatttaaa aacaaagata aaagaagcta ttttaattgc ctttggtgtg 6 0 

gctctctata ctttcggctt tgtcaaattt aatatggcta atcatttggc tgaaggaggt 12 0 

atttccggtg tcactttgat tatccatgca cttttcggag ttaatccagc actttcatca 18 0 

cttctcctta atatcccgtt atttatatta ggggctagga ttttagggaa aaaatcctta 24 0 

ttattaacta tatatggtac agtgttaatg tcttttttca tgtggttttg gcaacaaatc 3 00 

cctgtaacag tccctttgaa aaacgatatg atgttagttg cagtagctgc tggaatttta 3 60 

gcaggaactg gtagtggttt agtatttaga tatggtgcaa ctacaggtgg tgcagatatt 42 0 

attggtcgta ttgtagagga aaaatctgga ataaaattag ggcaaaccfct attgtttatt 4 80 

gatgctatag ttctaacgtc ttcacttgtt tatattaatt tacaacagat gctatatacg 540 

ttagtagcta gctttgtatt tagtcaagtg ttaacaaatg ttgaaaatgg gggctacact 6 00 

gttcgtggaa tgatcattat taccaaagaa tcagaatctg cagcagcaac cattttgcat 660 

gaaattaata ggggtgtaac ttttttacgg ggccaaggtg cctactcagg tagagagcat 720 

gatgtcttgt acgtagctct taacccaagt gaagtgcgtg atgtgaaaga aatcatggct 780 

gatttagacc cagatgcttt tatttctgtt ataaatgtag atgaagttat tagctcagat 840 

tttaaaattc gacgaagaaa ttacgataaa taa 8 73 

SeqID 152 

atgaaacgta gtatgtatgc tgggcgtgtt cgtagtgagc acataggaac atcgattact 60 

ttaaaaggat gggttggtcg tcgtcgtgac cttggaggcc ttatttttat tgatcttcgc 120 

gatcgcgagg gaatcatgca attagttatt aatcctgaag aagtagcagc ctcagttatg 180 

gcaaccgcag aaagtctccg tagtgaattt gttatagagg ttagcggtgt agtgacagca 240 

cgagaacaag ctaatgataa tttaccaaca ggagaagttg aattaaaagt gcaagaatta 3 00 

tcggttctta atacctcgaa aacaacacca tttgaaatta aagatggtat tgaggcaaat 3 60 

gatgatacac gcatgcgtta ccgttacttg gatcttcgtc gtccggaaat gcttgaaaat 420 

ttcaaattac gtgcgaaagt aacacactca atccgtaatt accttgacaa cttagagttt 4 80 

attgatgtcg aaactccgat gctaacaaaa tcaactccag agggggcacg agactatttg 540 

gttccatcac gtgtcaacca aggacatttt tatgcccttc cacaaagccc tcaaattaca 600 

aaacagctat tgatgaatgc aggatttgat cgttattatc aaattgtcaa atgttttcgt 660 

gatgaggatt tacgtggaga tcgtcaacct gagtttacac aagttgactt agagacatca 720 

ttccttagtg atcaagaaafc tcaagacatt gttgagggta tgattgctaa agtaatgaaa 78 0 

gacactaaag gtttagaggt ttctttacca ttcccaagga tggcatatga tgatgcaatg 840 

aataactacg ggtctgataa gcctgataca aggtttgata tgcttttaca agacttgaca 900 

gaaattgtaa aagaagttga cttcaaggtg ttctcagaag cttcagtagt taaagcgatt 960 

gttgtcaaag acaaggcgga caagtactct cgaaaaaata ttgataaatt aacagaaatt 102 0 

gcaaaacaat acggtgccaa aggtttagca tggctaaaat atgtagataa tactatttca 1080 

ggtccggttg ctaagttctt gacagctatt gaaggtaggt tgacagaagc tttgcagcta 1140 

gagaataatg acttaattct atttgtagca gatagtcttg aagtagctaa cgagacttta 1200 

ggagcgcttc gtacacgcat tgcaaaagaa cttgaactca ttgattattc taaatttaat 1260 

ttcctatggg ttgttgattg gccaatgttt gagtggtcag aagaagaggg acgatatatg 132 0 

tcggctcatc atccattcac cttgccaaca gcggagacag cgcatgagtt ggaaggggat 13 8 0 

ttggctaaag tacgtgcagt tgcctatgat attgttttga atggatatga gttaggcggt 144 0 

ggtagcctac gtattaacca aaaagataca caagagcgta tgttcaaggc tttaggattt 150 0 

agcgctgaat cagctcagga acagtttggt ttcttacttg aagctatgga ttacggcttc 156 0 

ccaccacatg gtggattggc aataggcctt gaccgctttg ttatgttgtt agcaggcaag 162 0 

gacaatattc gagaagttat tgcattccct aaaaacaata aggctagtga tccaatgact 168 0 

caagctccta gtctcgtttc agaacaacaa ttagaagaat tatcacttac tgtagaaagt 174 0 

tatgaaaact ag 1752 

SeqID 153 

aaagaaaaag gaaaactaat aaagaaaaaa attctagaaa ataatactga tattattcca 60 

aaaattatag aaaaaagtcc tcaaaatctt attcttacaa gtaattacaa cagagttaat 12 0 

attgataaaa taaaaaatat aaaaaatttt gataaaggat tcgaactagg ctttccactt 18 0 

tttgaaaaag gagaaattct gagaaaagaa ggcgaaataa ctagtgcaat tgaattattt 24 0 

gataaagcac gtgaactagg atattttgta cccgccttat acaattctta tgcgatggct 300 
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360 
420 
456 



360 
420 
480 



tttaggaaaa taaagaatta tgatgatgaa attctaatac tacaagaagg aattgaaaga 
ttcaaaaaaa gcaccctgtc atcaaatatt aacccaaaga caatcgatag atggagtact 
cgtatttcta gagctaaaga tttaaaatgt aaataa 

SeqID 154 , . 1 l u , _ n 

ttgaatgtaa aaaaacatca tttagcttat ggagctatca ctttagtagc ccttttttca 60 

tgtattttgg ctgtaatggt catctttaaa agttcacaag ttactactga atctttgtca 120 

aaagcagata aagttcgcgt agccaaaaaa tcaaaaatga ctaaggcgac atctaaatca 18 0 

aaagtagaag atgtaaaaca ggctccaaaa ccttctcagg catctaatga agccccaaaa 240 

tcaagttctc aatctacaga agctaattct cagcaacaag ttactgcgag tgaagaggca 3 00 
gctgtagaac aagcagttgt aacagaaaac acccctgcta ccagtcaggc acaacaagct 
tatgctgtta ctgagacaac ttatagacct gctcaacacc agacgagtgg ccaagtattg 
agtaatggaa atactgcagg ggctattggc tcagcagctg cagcacaaat ggctgctgca 

acaggagtcc ctcagtctac ttgggaacat attattgccc gtgaatcaaa tggtaatcct 540 

aatgttgcta atgcctcagg agcttcagga cttttccaaa cgatgccagg ttggggttca 600 

acagctacag ttcaggatca agttaattca gctattaaag cttatcgtgc tcaaggttta 66 0 

tcagcttggg gttactag : 8 

SeqID 155 

atgtcaaatt gggatactaa gtttttgaaa aaaggtttta cttttgatga tgttctgctt 6 0 

attcctgctg aaagccatgt tttaccaaat gaagttgata tgaagacaaa gttggctgat 12 0 

aatttgacat taaacattcc aatcattaca gctgcaatgg atacagttac agatagtaag 180 

atggctattg ctattgcacg tgctggtggg cttggtatca ttcataaaaa tatgtcaata 24 0 

gttgatcaag cagaagaagt tcgtaaagtt aaacgctcag aaaatggtgt tattattgac 3 00 

ccattcttct tgacaccaga taatactgtc tctgaagctg aggaattgat gcaaaactat 360 

cgcattagtg gagttcctat tgtagagaca ctcgagaatc gfcaaattagt tggaattatt 42 0 

actaaccgtg atafcgcgttt catttctgat tataaacaac taatctcaga gcacatgaca 48 0 

agtcaaaatc ttgtaacagc tccfcattggt acagatcttg aaactgcaga gcgtattctt 54 0 

catgagcatc gtattgaaaa gctaccttta gtagatgatg agggacgtct ttcaggactt 60 0 

attactatta aggatafccga aaaagttatt gaatttccta aggcagcaaa agatgaattt 660 

ggacgtctct tagttgcagg tgcagtaggt gtaacctctg atacttttga acgtgcagaa 72 0 

gctttgtttg aagcaggtgc agatgctatt gtcattgata cagcacatgg acattcagct 78 0 

ggtgtgctcc gtaaaattgc tgaaattcgt gctcacttcc ctaatcgcac attgattgcg 84 0 

ggaaatattg ctacagcaga aggagcgcgt gcactttatg atgcaggtgt ggatgtcgtt 90 0 

aaagttggta ttggtccagg ttcaatttgt acgactcgtg tcgttgctgg ggtaggtgtt 960 

cctcagatta cagctattta cgatgcagcg gctgttgcgc gtgagtacgg taagactatc 102 0 

attgctgatg gtggtattaa atattcaggt gatattgtaa aagcacttgc agcaggtgga 1080 

aatgcagtta tgctaggttc aatgtttgca ggaacagatg aagcgccagg agaaacagaa 1140 

attttccaag ggcgtaagtt caagacatac cgtggtatgg gctcaattgc agcaatgaaa 12 00 

aaaggttcaa gcgatcgtta tttccaaggt tctgtaaacg aggctaacaa attggttcca 12 60 

gaaggtattg aaggtcgtgt agcatacaaa ggttcagtag ccgatattgt tttccaaatg 13 20 

cttggtggca ttcgctctgg tatgggatat gtaggtgcag cgaacataaa agaacttcat 13 8 0 

gataatgctc aatttgttga aatgtcagga gctggcttga aagaaagcca tccccatgat 1440 

gttcaaatta caaatgaggc accaaattat tcagtacact aa 14 82 

SeqID 156 

atgacaatta accactataa gcttcgtata ccttactata caataagctt tctacttccc 60 
ttcataatca tagtctgtat cctctttact aagaatattt attggggaag tccaacaacc 120 
atattagcaa gcgatggctt tcatcaatat gttatcttta atcaagcatt aagaaatatt 180 
cttcatggtt caaacagctt attctatacc tttacaagcg gtttaggttt aaatttttat 24 0 
gctctgagta gttactattt gggaagtttt ttatccccaa tagtttattt ctttaaccta 3 00 
aaaaatatgc ctgatgctat ctacctttta acaatatgta aaatagggtt gataggtttg 3 60 
tcaatgtttg tcactctttg taaacgtcat tgtaaagtta atcgtgtctt acttttagta 42 0 
atttctactt gctatagctt aatgagtttc tcaattagcc aaattgaaat taatatgtgg 48 0 
ctagatgtat ttattctaat tccccttgtt gttttaggag ttgatcagct actatgggaa 54 0 
agaaaaccta tactttattt cctctctctg acagccttat tcatccaaaa ctattatttt 60 0 
ggttttatga ctgccatctt tactagcctt tacttcatcg tacaaataac ccgtaatact 660 
gatagtaagg ttgcatttaa acaatttctt cattttacct ttttatctct tttagctggt 72 0 
atgactagta gcattatgat tttgccaact tattttgatt taactactca tggtgaaaag 78 0 
ttgacaaagg tatcgaagat gtttacagaa aattcttggt acatggattt gtttgctaaa 84 0 
aatatgattg gagcctatga tacaacaaaa tttggctcta ttcctatgat ttatgtaggt 90 0 
ttacttcctt tactcctttc tttactttac tttacaataa aggaagttcc tcgaaggact 960 

agattagcat atggattttt aataatattt gttattgcta gtttttatat aacacctcta 102 0 

gatttatttt ggcaagggat gcatgctcca aatatgtttc tccatcgcta ctcttgggta 108 0 

ctttcagtac ttatctgtct gctagctgct gaatgccttg agtatttaga caacatttct 114 0 
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tatccctttc ttattggttt tgttgtatta gagatgacat taaacacctt 
aataqtctta acgatgaatg gatatttcca tctaggcaag gatatgctaa 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



aatcaagtat atcaagataa accatttact gtgaatactt tggataatca gacaactctt 18b0 

ttaaatcaat taagtggttt gaaagagacg tattttgagc atcttattcc aaatagcgtt 1920 

tctqggcaaa ccactcttaa taaacaagta tttgttaaaa agaataaaca agggaatact 1980 

gaaattacct ataacataac tatccctaaa aacagtcagt tatatgttag tatgccgttt 2040 

ataaatttta ataatgagga aaataaaatt gttcaaatct ccgttaataa tggaccattt 2100 

gtacctaata ctcttgataa tgcttattca ttttttaata ttggttcctt tgctgaaaac 2160 

agtcgtatca aagtaaaatt tcaatttcca cacaatgatc aagtttcttt cccaattcct 2220 

cacttttatg ggctcaagtt agaagcttat caaaaagcta tgactgttat taataaacga 2280 

aaagtaaaag taagaactga tcacaataaa gttattgcta attatacaag tcctaataga 2340 

agttctcttt tcttcactat tccctatgat aggggttgga aagcttatca aaataataaa 24 0 0 

gaaataaaaa ttttcaaagc tcagaaaggt tttatgaaaa taaatattcc taaaggaaaa 2460 

ggcaaagtaa cattaatttt tatcccttac ggctttaaat ttggggtagg cctatctatt 2 52 0 

actggaatag tattatttac tgtctattat tttaaatttg gaaaaaataa aataggatga 258 C 

SeqlD 157 

tggtttatga agcactcggt tgggaagcgc cacaatttgg tcaca 

SeqlD 158 _ 

atgaaagatc acctaaggtg cttcctccat taccgatcca aaaacgttgg gaaaaatgga 6C 

acgagattga agggaggaaa gcctaatgac 9C 

SeqlD 159 

gacaaacagc ttttccaatg cgtgcagggc ttccaaataa ggaacctcaa tggcaagaag 6C 

catgggatca agctgatatt tataaaaaac gtcaagcat 95 

SeqlD 160 

aacgaggaag ccatagatgg ggtgtccaag cttgggaaaa aaatcaccat tctaggaaat 6( 

aattcatacc tttcttcc 75 

SeqlD 161 

atcaaacaat ggctattggt gatgaagaaa atgaccgtgc tatgcttgaa gttgttggaa 6( 

atccfcgttgt tatgcaaaac ggcaatcctg aac 9 - 

SeqlD 162 

aggcgcattt caaacaatct aggagtttta aaggaagcga ccgttataca cgccgta 5' 
SeqlD 163 

gagctctgcc taaagctgag caacgtcaag ccaatctata tgccttggca cttcgagcaa 6 

atcagtttga aaaaacaggt ttcaagggtt tatcacgctt tattcgaa 10 

SeqlD 164 

agaagctgct tcagtaggtg ctatccaaga tttaacagat tcagatatca ttttctcaaa 6 

ccaccgtgga cacggacaaa ctattgcaaa aggtat 9 

SeqlD 165 

agatggtfctg gttttagata tgacgatggc agaaaattta gctttacaaa cttatta 5 
SeqlD 166 

cccttaaaaa gtttgtcgat gacatcaaga atgaagctat tgaggttatc acaaagaaaa 6 



SeqlD 167 

aggcacttgc gaaagagaca aaagcaaaat catttaatga atttatgctt tctatgtgcc 
gtgaaaaaat tgaaaaagga caatttaatt 



gctatg 
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SeqID 168 

aagttattta ttcagcctat tcaggaaaat atgaagcagc caatcgagat tatatcaaat 6 0 

tactatgcaa agcttgggtt aatgaaaatc 9 0 

SeqID 169 

tatcaaacgt attcgaaaaa ggcaagaaaa tttggttccg tttggcttaa acctaag 57 
SeqID 170 

acgctgttat tccagatact tctcctgctt tctcatttga aaaagaacat ctttctggag 60 

tggaatatgc acgctactta caat 84 

SeqID 171 

aagaagtttt acgaaataat attagcaacc aagtaccaca catcagtgtt caaatggagt 60 

ttaaaactca agaacaagtt gacgaatacc aaaaaaatct cggaagcatc atccgggaaa 12 0 

ttggagatac acttggaaca gcaactgaat tcaatgccaa aagtaacatt agcacttata 18 0 

ctcttggtgg acaaatccaa cgcattattg 210 

SeqID 172 

ttaaagcaaa tgttccgtcg cgtattgcat ttgctgtttc aagtggtact gatagccgta 60 

cgatccttga tgaaaatggt gctgaaaagc tcttgggacg gggtgacatg ctctttaagc 120 

ctattgatga gaatcatcca gtacgactac aaggttcctt tatttcagat gatgatgttg 180 

aaaggatcgt tggttttatc aaagaccaag ccgaggctga ctatgatgat gcctttgatc 24 0 

ctggagaagt atctgaaaca gataacggct ctggtggtgg cggcggagta cctgaaagtg 3 00 

atcctcttfct tgaagaagcc aagggactcg ttt 33 3 

SeqID 173 

ggccggaaaa ttaattttag tcgtccatca cgatttgag 3 9 
SeqID 174 

agagttaagt ataaaaaatg ggaaccttta tcaatatccc gttcctgcta tgaaaaatct 6 0 

tcgtcaacat caagcagaat t 81 

SeqID 175 

acttggtcaa gatgttgtat tcccaggtgt tactcgtggt gcaaaattag aagaagcaat 6 0 

caatgctttg gaagatggac aagttctttt ggt 93 

SeqID 176 

aaggagaatt tcgttcaaag gttgaggagt ctattgaaag caatcagcaa aaatataatc 6 0 

atttatagtc tagtttctct accattagct agacacttg 99 

SeqID 177 

agtataatca atattaaact agaaatatat gacatatgtc ctaaaactat ctcaaatagg 60 
gagtcaatac atggatatag aaaactactg ccaagactac tgtttggaga ctatctttcc 12 0 

aaaacattat ttcaaggatt tacaagtaat caccctaact gccggccaat ccgtctg 177 

SeqID 17 8 

ctttctttca gaagaagtcg gcgaattggc tcaagccatt cgcaaata 48 
SeqID 179 

ttcattttca aggataggtt taaaatgaga cacttcacgg ac 42 
SeqID 18 0 

gggttgattt gtgcttgtaa agcctttatc tctgccatac tggctaactt attttgttcc 60 

tctgttatcc ccattgccag ttgtcctgaa aatatttgcg ctaaaccaag gactaggttt 120 

tcctccacct cagacattgt cfctatctcct gcaaagtaca tttttaaggc acccacagtt 18 0 

ttatcattta tttttagagg aactacaata gcagaattta ac 222 

SeqID 181 

tcctcctctt tggacaccca ttacaaacat agggagcttt tcttagtagc ggacagtcta 60 

ggcagtcttt ggttgattct cttagttgct tatttcgtt 99 

SeqID 182 

cacctggtgc ccactcaata tcataaacag aatctacctc ttgaatataa gaggctaaac 60 

gctctagtcc ataag 75 
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SeqID 183 

gcacacctac acgaagggta tcataagtat ccactaaaaa gacgcaatct ttatgagttt 6 0 

ctgcgtaagc tt 72 

SeqID 184 

ccattgaaag agctccgcga acattatata attcctcgga aaattcgtct tcaaaacttt 60 

ccccataga 69 

SeqID 185 

aagtatacta ttgcaggctc tcgtgtcacc cagcgtegac catcattttc tgttactaaa 60 

gta 63 

SeqID 18 6 

gcagcttttt catcaccagc agctaaaaac agagcattca ttacaacacg atcttca 57 
SeqID 187 

tccatggtta aacaaatatc gttctctgac catttgatat cacgttttga aggcttatgc 60 

ttgaggcgat tttctgtgcg attccgtttg agacgctcac ttagctctgt ttctaattct 120 

acaaataata cctcttggtt g 141 

SeqID 18 8 

taccacgttc tttctcgata tcatttgaat ccattgcacg ttcttcaagc tctttacgct 60 

catcaagagt atgtgattgt tttaataatt catcaacgag agttgtttta ccgtggtcaa 12 0 

cgtgggcaa 12 9 

SeqID 18 9 

gcacagcctt cttcaactag gatatcccct aaaattgatc gttttacacc taag 54 
SeqID 190 

tggaatagaa ctatcaatta ttttttgaat aacctgcaat tggtgactaa tatcaagtfcg 60 

cccacgagaa aaattttcca gtcccaaaac aagtttctga gcttcatcaa aaac 114 

SeqID 191 

gtccaagaag gccataaaat tttccctctt caaaga 3 6 
SeqID 192 

caacaaagca ataacaccgc caattatagt cgcccctata aaccttggca taaaaagaaa 60 

aatggattga gctgcagatg ctttagc 87 

SeqID 193 

ttctttgatg ataccaaagt taacattcat tggttggaaa tgtttgctgt ctgtatgtgt 60 

aatataatgt ggcagtgcac caatagcagt agtttgtggg aaaactactt cactctcccc 120 

attaaagcga cgggcagcat tgattcccgc tacaagacca cttgctgccg attctacata 180 

accttcaaca ccggtcattt gtccagcaaa gaaaagattt gggttcttac gtgtagcaaa 240 

ggtttggtc 249 

SeqID 194 

gtgtattcgt tacgcatagc gacattggaa ctagaacgaa acttgataga caagcgacgt 60 

gggaaaccaa ccgattctag gtcaatagag tttttagagg ttagtttagc aatattatct 120 

tgcaagaata agttaatacc tgacatcatg ctagagaaga catttccctt agtttcttct 180 

gaagcgaagt cagaagacct aaagtttaaa tcagccatat ctctaaattt ggaaaattgt 240 

tcfc 243 

SeqID 195 

ggccatagca atggcattaa agagggaaat ggaggtatct tcccagtatt ttgcattgcc 60 

attaccgcta ccaggtttcg ttttacgata aatggcttct gcaaccgcat taacccttgt 120 

ctgtgtcatt tcatagtaac ccttcttagc tgctgcaatc gcaagagc 168 

SeqID 196 

agaatagaga aggctatcgt ccctaaaccg gccattagtg ttctgatgat aatctcaagg 60 

atggtattag gaacaccaga aaaagaacct gtcatttgtt ctgcaaacgg aataatttca 120 

attgtaatag ccacacctgc tgatatagaa atacaaatga gtaaagcaga cataatcaaa 180 

gctaaacctg ttgaaaaatt attctgtgaa aattctctca ccgaattaac aaaggcagcc 240 

cctggcacca taaccattag tgcccctaat ataataatgc tacgatgctc acctaatcct 3 00 
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gaaaaatata ataaattagc acttaaagcc actacagaac tacctagaat ggtcaaaaga 3 60 

aatcctgtat gtatacgact ttccatcaaa ttcagaaaat atcctaaaat taagcctgtt 42 0 

acagcagcgg aaaatgaatc taataaggaa ctacctaagg ctaaagaaaa actacccgct 48 0 

ccacaaaaa 48 9 

SeqID 197 

ccatattctg taacatatgc tgctccagta atttgtaatg gagcggattt agcacgacga 60 

acagcatatg ctatgtcgtc atcagttaca aaaccttctg atgtagaaga attagttacc 12 0 

ataggtgata gaacaaaacg attttctagg ctcaatccat tgggaagtat taatggacga 18 0 

aataaattac cttggacatt attcatctct gctcaccgtc cccttattag ccatagactt 240 

agcctgaaat ctatcaaact ctgcaac 2 67 

SeqID 19 8 

gccaaccata atgacaaaaa taatacgata agcgttaata aatttagtgc caaa 54 
SeqID 199 

tccttctgcg tattttgttg ccattccaaa aaaggcagct acccacatcc aaaacaaagc 60 

tcctggtcct ccagatttaa tagccgtagc cacaccaacg atattacctg tccctaccgt 12 0 

tgoagcaaga gcagttgcaa gggcagcata act 153 

SeqID 20 0 

acattaggca tttcttttaa aataagaggg tgccatttct ccgcaaaacc ttttctaggg 60 

ggtaaatctc ctgattttcc tttgcctggg 9 0 

SeqID 201 

tttaatattt ctaatagttc cttaggtttt ggtacttctg ctaaatcagc aagtacttcc 60 

tcagctaatt cttccttaat aatttcttgt gataaggcca cacattcatt acaaatgaaa 12 0 

acaccattac ctgcaataat tttttttact tcatcttggc ttttgccaca aaatgaacaa 18 0 

SeqID 202 

ggtgtttata aaaagcagag cctaagcctg attttccatc agattcacct gattgtttat 6 0 

cattttgatt ttttgcatga taggtttgtg ccttattafcc gagaa 105 

SeqID 203 

attttcttac gatcagtatg tggaaattca cgacaagctt taggtctgac atcgtatata 6 0 

ctacataagt tatcatcgcc taaaaacgga caaggcatcg ctttaaatac tttatctcca 12 0 

tcttcatcta cttctaaaaa taaatcttca aaagccgata acttcatccg gaaatgttta 18 0 

gaaatcctag aaatatcagc ttctgtaaat agaggaccca aagttttaca acaattagca 240 

catgctgtac aatcaatttc t 261 

SeqID 204 

agtagaataa gtacaataac aataccaata aggataagcc caactttttt atatagagag 60 

gacttttctt caggacttaa aggatcgttt ggtctgagac tctcttcagt caaatatttt 12 0 

tttccatcta aggtatattg gaggagacca aagaacatac caatagcagc aagtgagaaa 180 

ccaagatgga aatttacttc ttgtccgagg taccccacta caatgggtga aataaaagca 24 0 

ccaaggttaa taccaaaaac aaagatacta aaaccagcat cacgacgaga atcattttct 300 

ccatataaat taccaaccat ctctgataca tttggtttta atagtccagt accaaggata 360 

attagagcaa tcgaaatgaa tagagctgtt 3 90 

SeqID 205 

tctfcttcctc gttcaaataa tttatgtgct tttaaagata agttcaaatg tcgcacaagt 6 0 

tctcgattag actgttta 78 

SeqID 2 06 

atccattcat ttcatagaag gaaattaaat catcatggca agttaaactt ataccatctc 60 

gttcttgaga aacaaccagg tctttcatcg ctgctaataa tgctgtacca atcccctgac 12 0 

ctt 123 

SeqID 2 07 

ctgtcttttt cgatagtatc ccatttatta actacaataa taataccttt accagcttca 60 

tgtgcaaacc ctgcaatacg cttgtcatac tcacgaatac cttcctctgc attaatcacc 120 

ataagaacaa catctgagcg gtcgatggcg cgcatagaac gcataacaga atatttctct 18 0 

gtgttttcat ataccttacc agatttacgc ataccagcag tatcaatcat agtatactct 24 0 

tgcccctggc tatcaacgaa attagtatca atagcatcgc gcgtagtacc agcaactggg 3 00 

ctagcaataa cacggtcttc cccaaggatg gcattaatta aactagattt cccaacattt 3 60 
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ggacgtccaa tcaaactaaa acgaataata tctggatttt cattttcctc ttctaccgga 
agattttcca caatagcatc caatatatca ccagtaccaa tacca 

SeqID 20 8 ^ 
tgcccatcct cagatatagc tggaattaat tttaccaaaa atagcggaaa aatagataaa 

caaaccgtta ca 
SeqID 209 

ccctttctca gaaatcattt ccataacaac gagaccagca ccaaattctt tagcgatagt 
acgaaaagca gaattggtaa tacctgccat cggtgc 

SeqID 210 

gcgtcgacag gatcagcagc gaatccacag actacgccag gaaaactatt acaagcaagc 
atcgctccaa ttcctgtccc acagcctgta ataacaaaat ctgcagcacc gctattaagt 
aaaatagcag tcagtaaacc attttgtaca 

SeqID 211 

gcaatgccta aacgttgctt cataccaccg gagtatgatg aaatcttctt tttacctaca 
tcgcctaaac caacaagctc tagtaatcta ggaatttctt gtttagcttt ctttttatca 
acacctttta aggtagcgat a 

SeqID 212 

cttttactga ttttaaaaaa tccagatact acacataaca ccaccataaa aagacaacct 

aacaaacccc ctaaagtttt acgaaaaaga actgaaaaaa agagtattaa gcttatgaaa 

agaataccga taaaccafcgt tgcaccg 

SeqID 213 

aatttcaaag cacgaccacc aggtgttgtt tcagggttac caaacataac gcctactttt 
tcccttaact ggttgataaa aattgcgatt gtctttgtct tgttaatgga tgctgacaat 
ttacgcattg ct 

SeqID 214 

aaaagagtag ggatttgcca catatatcat gtatcccaat ccagctgttt ccacaacaat 
aaatttagcc gttattttac tcaattttcc tttaatataa tcgtacattc tattttccta 
tctcaatatt tttcattttg g 

SeqID 215 

aafctcaacga caagtgcttc attgatttct gggttaattt catcgcgttc tggaaggcga 
gtcaatgaac cttcaagttt ttcagcatca aatgatacga aagctggacg tcctaaagta 
gcttcaacag cttcaaggat tgcaggtact ttcattgatt tttcacgaac tgaaataact 
tgtcctggag taacgcggta tgatgggata tcaacacgtt taccatcaac aaggata 

SeqID 216 

acaagaccct tagtagaaat cttagttgat aaactagcgc catataagga agctaacttt 
tctcgaaatt cttgtacctt aggatactta gcatttgcag taactaacat t 



420 
465 



60 
72 



60 
96 



60 
120 
150 



60 
120 
141 



60 
120 
147 



60 
120 
132 



60 
120 
141 



60 
120 
180 
237 



60 
111 



SeqID 217 

agaactcgaa 

gaaataatca 

atgccacgct 

gaaactatta 

atatgcttga 

cgactcaatt 

aatgaagccc 

gccttagtca 

atcattgacc 

gaactcgtta 

cattcagatc 

ggaatcacac 

acgtatcgtg 

ctagctctta 

ttcaggtttt 



atgcaattag 
ctgctaataa 
caagttatta 
aaagaatttt 
ataacgaagg 
tggtttctgt 
ctattcccaa 
ctgacttaac 
tatacaaccg 
agcaagccat 
gtggcaaaga 
gttcgcttag 
ctttcaaaat 
agactgaaaa 
taccaatgct 



atatcctaaa 
ggataaatac 
ctatcaagcc 
tctcgatagc 
tatcacactt 
ttatcagaaa 
ccacttagac 
ctatgttcgt 
tgaaatcatc 
acaaagcatc 
gtttgataat 
tcaggctggt 
tgaatttgtt 
ggcaacactt 



gcaagcagcg 
agcatttcaa 
gtggagtcag 
gagtctagat 
tcacgtcgtc 
gccaccttca 
aggcaattta 
gtaggcaatc 
ggcctgtctc 
ccttacgccc 
cagttaattg 
tgtccttatg 
tatcaagaaa 
ttctgtacaa 



gtgattatgg 
agatgtgtcg 
tatctaaaac 
acggatccag 
ggattcgacg 
aaccacattc 
agcaagaaag 
gttgggctta 
ttggttggca 
tgaccaaagt 
atgaaatatt 
acaatgccgt 
cctttcaatc 
catttataaa 



cacaaaaagg 
ctggctgaat 
ggagtttgaa 
aaaaatcaaa 
cattatgaag 
tagaggcaag 
accactacaa 
tgtttgcctc 
caagaccgct 
caagatgttc 
ggaagccttt 
agctgaaagt 
gctggaagaa 
gtgttgcctt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
840 
861 
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MKKVXiTFLLCSL YFVS I PAI STEEPLTLSQNRRYAIjTQTVVDKEMYFDAIPERPTTKIEIS SFQDEALTITGETIiVP 

NTLLSIVSLTINSNGIPVFTLSNGQFIJ^SREAIFNDLVSKQQSV 

RVHLVEQAETEHGIYYKTDSGFWISVFJDLSVAD]^^ 

I AKLATLYAS QEQVKLGKLSLD SKFEYKDNV1SFQFPNS YDP SGS GKLEKKADHKLYTVKELLE ATAKESD1WATNMLG 
YYVNNQYD SMFQTQVDTI SGMHWDMKKRQI S PQAAGKMMEAI YYQNGD I VINYLS KTDFDNTRI PKNI PVKVAHKI GD 
AYD YKHDAAI VYAEQPFIMI I FTDKS SYDDITKIADDVYQVLK 

SeqID 219 

MKKRI L S AVLVS GVTLGTAAVTVNADDFD SKI AATD SVINTLSGQQAAAQNQVTAIKGQVGALE SQQS ELEAQNAQL 

EAVSQQLGQEIQTLSNKIVARNESLKKQVRS^ 

KAALEAKQIENaWAINTVAAl^QAIEimK^ 

QAAAE AKAQAEAKAQAE S VAKAQAAAQVES ATAPTETVQTQPRTEIKP SNLTAT S S ATTVATTTATATNE PKVTQP S 
WTKAVEAPKAWS STPRAVS KPWRS YD S YPMGQCTWGAKSMAS WVGNYWGNANQWGAS ARAAG YS VGTTPRV 
GAVAV W P YD GGG YGHVAVVT S VANNS S I QVME SNYAGNMS I G3NTYRGS FNP S AS G S VY Y I YPN 

SeqID 220 

MNKRIFVEKKADFDIKSASLVKEL^ 

LDKVAFFAIEALPGQFDQRAASSQESLLLLGSDSQVKTOTTAQLYLVN^ 

EEQAFSVSDKTIPSLDFFETYKADDFAAYKAEQGLAMEVDDLLFIQDYFKSIGRVPTETELKVL 

ETELKNIDFSASKFQKQLQATYDKYIAMRDELGRSEKPQTL^MATIFGRYERANGRLDDMEVSDEINACSVEIETO 

VDGVKEPWLLMFKNETHNHPTEI EPFGGAATCI GGAIRDPLS GRS YVYQAMRI S GAGD ITTP IAETRAGKLPQQVI S 

KTAAHGYS S YGNQI GLATT YVREYFHPGFVAKRMELGAVVGAAPKENVVREKPEAGDVVILL 

SKVQTVESVETAGAEVQKGNAIEERKIQRLFRNGI^^ 

GTEIAISESQERMSVVVRPSDVDTFIAACNKENIDAVWAT^ 

VVDKDLTVPEVRTTSAETLEADTLK^SDL^ 

TTTASVMAQGYNPYIAEWSPYHGAAYAVIEATARLVAT 

IQLGLPSIGGIQDSMSGTFEELTVPPTLVAFGVTTADSRK^^ 

TIQAQHKITAASAVKYGGVLESLALMTFGNRIGASVEIAELDSSLTAQLGGFVFTSAEEIADSVKIGQTQAAFTLTV 

NGISTDLAGASLLSVFEGKLEEVYPTEFEQADALEEVPAVVSDT^ 

ASWIiVAFVTLNEAAIADSVDTMVANIAKANIIFFAGGFSAM 

I CNGFQALVKSGLLPYGNFEEAGETS PTLFYiroANQHVAKMVETRI ANTNS PWLVGVEVGDI HAI PVSHGEGKF WS 
ASEFAELRDNGQIWSQYVDFDGQPSMDSKYlSrPNGSWAIEGITSKNGQIIGKMGHSERWEDGLFQNIPGNKDQALFA 
SAVKYFTGK 

SeqID 221 

MKMNKKVLLTSTMAAS LLS VAS VQAQETDTT WTARTVS EVKADLVKQDNKS S YTVKYGDTL S VI SEAMS IDMNVLAK 
INNIADINLIYPETTLTVTYDQKSHTATSM 

I VSPMKTYS SAPALKSKEVLAQEQAVSQAAANEQVS PAPVKS ITSEVPAAKEEVKPTQTS VSQSTTVSPAS VAAETP 
APVAKVAPVRTVAAPRVASVKVVTPKVETGAS PEHVS APAVPVTTT S PATD S KLQATEVKS VPVAQKAPTATPVAQP 
AS TTNAVAAHPENAGLQPHVAAYKEKVAS T YGVNEFST YRAGD PGDHGKGLAVD F I VGTNQALGNKVAQ YS TQNMAA 
ISTNIS YVIWQQKFYSNTNS I YGPANTWNAMPDRGGVTANHYDHVHVS FNK 

SeqID 222 

MLKHFGSKVRNLRWRNITREDFCGDETELSTOQLARIESGQSIPNLTKAHYIA^ 
ELKY-KLLRTPTYGDANRLAWEAYFDEIYEVFYEELPEDERL 
EYTT^FVMIDLYLLCFSIISTCGMKSLYSLElSre^ 
HRSNAIMTEIHDFQIOIPILSLIEWKYLLIIEKDRTKAET^ 

SeqID 223 

mtektkavettdvaiaidtlvqngl 

lfacehvvlsnsnyirhtktvgvi eeddvtgltl i aepvgwcgi tpttnptstai fks lis lktrnpi i fafhp s aqe s s 

ahaariwdaaiaagapencvqwieqpsidatnalmnhdg 

sanirqaahdivmsksfdngmvcaseqaviidkeiykefveefksyhtyfvnk3cekal 

lnpnivgksawiaeqagftvpegtnilaaectevsekepltrek^ 

aaihtkdadlarefgtrirairviwnspstfggigdvynaflpslt^ 

nmqwfkvpsktyferdsiqylqkcrdver^ 

gtdli^tfkpdtiialgggspmdaakviwlfyeqpevdfhd 

vtpfavisdkaisnsjrkypiadysltpw^ 

vfdylers vkdadfearekmhnas tmagmafanaflgi s hsmahki gaqfhtvhgrtnai llp yviryngtrpakta 

twpkyj^yradekyqdiakllglpaatpeeavesyakavydlgtrlgik™ 

c s panprl pmvdhmqe 1 1 eday ygyeerp grrk 

SeqID 224 

VGIKVYKPTTNGRRNMTSLDFAEITTNTPEKSLLV^ 
VKTIEYDPNRTANIALVHYTDGVKAYII^ 



WO 2004/099242 PCT/EP2004/004856 

71/109 

IRAAGASAQVLGQEGKYVLVRLQSGEVRMIL^ 

HGGGEGKAPVGRKAPSTPWGKPALGLKTR2STKKAKSDKL IVRRRNQK 



miefekpiitkidenkdygrfvieplergygttlgn^^ 

LNVKGIjAVKS YVEDEKI X ELDVEGPAEITAGDI LTD SDIE I VNPDHYLFTI AEGHS LKATMTVAKNRGYVPAEGNKK 
DDAPVGTLAVDS I YTPVKKVE^QVEPARVGSITOGFDKLTIEIMTNGTI I PEDALGLSARVLIEHLNLFTDLTEVAKA 
TEVI^ETEKVNDEKVLDRTIEELDL 
KNDK 

SeqID 22 6 

MKKIRLSKFIKMI WILFLI SVAAS FYFFHVAQVRDDKS FI SHGQRKPGNSLYAYDKS FDKLLKQKIEMTMQNIKQV 
AWWPAAKKTHKTAWVHGFANSKENMKAYGWIiFHKliGYNVLM 

NP S S Q I TLFGVSMGGAT WIMASGEKLPSQVVNI IED CGYS SVWDELKFQAKEMYGLPAFPIiLYEVSTI S KI RAGF S Y 
GQAS SVEQLKK3SENLPALFIHGDKDNFV^ 

SeqID 227 

MSKVRGFELVSQFSNKELLPKRETAHAAGYDLKVAKKTVIEPGEITLVPTGIKAHMQPGEVL^ 
LINSVGVIDGDYYTSINQVNEGHIFAQMQNITDQAVI^ 

SeqID 22 8 

MKFGKKIjGFLALLMSIVLILGACGKTGLGNSTGN 

S I. KVADAQDDAARQADDVQNFI S QNVDAILINPVD S KAI VTAI KS ANNANI PVI LMDRGSEGGKVLTT VAS DNVAAG 
KMAADYAWKLGKKAKAFELSGVPGASATVDRGKGFHSVAK^ 

AQNDEr^GAAQAVKSAGLQNVLIVGIDGQPDAHDAIKKGDISATIAQQPAKMGEIAIQAAIDHYKGKKVSKETISP 
IYLVTKDNVEKYNW 

SeqID 22 9 

MGKEKLI^YSGGLDTSVAIAWLKOTYDVIAVCMDVGEGKDLDF 
HAMYEQKYPLVSALSRPIIAQKLVEMAHQTGATTIAHGCTGKGNDQWFE 
TFAKANGVPI PADLDNPYS IDQNLWGRANE CGV^ 

EMTLADLILSLNEIAGKHGIGRIDHVENRLVGIKSREIYECPAAMVLiLAAHKEIEDLTIiW 

IYX*ALWFSPATKAIIAWKETQK^A7NGTTK^^ 

PTQVNAQVNKG 

SeqID 230 

MAKVCYFTGRKTVSGNNRS HAMNQTKRTVKPNLQKVTVIjIDGKPKKVWVS ARALKS GKVERV 
SeqID 231 

MAKQKNNWRRVGVGVLTIiASVATLAAC 
DGKTRPDLATKOTVSKDGLTYTATLRKGLK^ 

EKDLNKLGVKAEGDDKWITLSSPSPQFIYYI^FTNFMPQKQEWEKYGKDYATTSKNTVY 

TLKKISJKISnirWDAKNVKTKETOIQTOKKPDTAVQMYKRGELDAAW 

SVKGLDISTVT^IRRALNLATNRK 

SGLTKLKLTITADADVPAAKNSVDYIKSTWEAALPGLTVEEKFVTFKQRLEDSRKQNFDIWSW 

LFKSDSQNNDGKFANKDYDAAYNKAISEDALKPEESAKDYKEAEKILF 

GLSIDFTHAYKK 

SeqID 232 

MELLKTPIFGICFSLILYTIGQHLFKKSKGFFLLQPLFFAMVSGIVILWLMSKGLGTDVKTFYTQAYKPGGDLIFWF 
LNPAT I AFAVPLYKKND WKKYWVE I LS S LVI GMI VSLMLI VAI SKMVGLSQVGIASMLPQAATTAI ALPI TAAI GG 
NTAVTAMAC I LNAVT I YALGKKLVS FFHLNDS KI GAGLGLGTS GHTVGAAFALELGELQGAMAAIAVVVIGLVVDLV 
IPIFSHLIGLL 

SeqID 233 

OTKYLKYISFVALFLASIFLVACQNQNSQTKERTRKQRPK^ 

SPELDIKGELAXKYKISKIDGLTWSFDLNDDFKFSNGEPVTADDWFTYDMLK^ 

LTEAHSTFTAQLTEIPIVPKKHYITOKYKSNPIGSGPYMVKEYKAGEQAIFVRNPYWHGKKPYF^ 

AALESGDVDMIYATPELASKKVKGTRLLDIAS^^ 

QKOTjDTVLNGYGKPAYSIIDRTPFWNPKTAIKDN^^ 

NLAVE VAEQAKALGIT I KLKASNWDEMATKSHD SALL YAGGRHHAQQF YESHYPS LAGKGWTNI TFYTfllSTPTVTKYLD 

KAMTSPDLDKANKYWKIAQWDGKTGASTLGDLPNWLVSLNHTYIGDKRI3WGKQGVH 

SAK 

SeqID 234 
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MGQEPI IEYQNINKVYGENVAVEDINLKIYPGD WC FXEi 

LRRRIGYVIQNIGLMPHMTIYENIVLVPKLL 

AADQDIILMDEPFGALDPITREGIQDLVKSLQEEMGKT^ 

ATSFVEQMIGEERLLHAQADITFVK^ 

KKDRLVSDILKHTDFYVMEDDLLRNTAERIL^ 

SeqXD 235 

MKIDKKEFLALIASIIIiLIFASVTFFLFKDHGTTQMDTVESSVNHVSDSQLT 

NKL SNS S KKEALQKRFKKAKI)KYIjKBEADKKATKDATDIjVEI LEQAP S EE^JVXjKAEAAVNKLTVKE S KEALQKRIDT 
VKTQYGLIGNQT PS S SVAETTEQGTANPAS QDT S S YVHQ1WAPT YEQPQTJOTTPVTPGVl^TVPT PGTGTAPATNGT 

GVAQ 

SeqID 23 6 

MTKDLLLELGLEELPAYVVTPSEKQLGQK^^ 
IALDAEGNFSKAAQGFVRGKGLSVDDIEFREVKGEET 

YIRPVHTLWLLDDQALELDFLDIHSGRISRGHRFLGSDTEIIiSASSYEDDLRQQWIADAKERQQMIVDQIHAIEE 

KKNISVEIDEDLLNEVIjNLVEYPT^^ 

EJSTVTKGNEKVXjVARIjEDGEFFWQEDQK^ 

LRAASIYKFDIiljTGMVDEFDELQGIMGEKYALLAGEQPAVAAAIREHYMPTSADGELPETRVGAIIA 
FFSVGLIPSGSISTOPYALRRATQGIVRILEAFGWDIPL^^ 

KDIREAVLESDTYIVSLILEASQALVQKSKDAQYKVSIESLSRAFNLAEKVTHSVSVDY 
ELTEDMHDNLDKL FAL SPIINDF FDNTMVOTDDEKMKQNRLALLNS LVAKART VAAFNLLNTK 

SeqID 237 

MTFDTIDQIiAWTTOTLSIDAIQAANSGHPGLPMGAAPMAYVLWNKFLN^ 

LLHI^GYDLSIDDLKQFRQWGSKTPGHPEVNHTO 

TLHGDGCLMEGVSQEAASLAGHLKLGKLVLLY 

I EAAKAETDKPT 1 1 EVKTI I GFGAEKQGT S SVHGAPLGAEGITFAKKAYGWE YPD FTVPAE WARFASDLQARGAKA 
EEAWNDLFAKYEVEYPELAAEYKEAFAGQAETV^^ 

MVAAETDFQASNYAGRNIWFGWEFAMAAAMNGIALHGGTRWGGTFFVFSNYLLP 
AVGEDGPTHEPIEQIASVRSMPNL3WIRPADGNETNAAWQRAVSETDRPTMLVLTR 
YILSEAKGELDGIIIATGSEVKLALDTQDKLESEGIHV^ 
GWGKYVGI/NGLTLTIDTWGASAPG1SIRIFEEYGFTVENAVSDYKEL 

SeqID 23 8 

MTLQDQIIKELGVKPVINPSQEIRRSVEFLKDYLLKH^ 

I AIRLP YGIQADEEDAQKALDF I KPDI ALT INIKEAVDGQVRALNAAGVE I TDFNKGNIKARQRMI S QYAVAGQYAG 

AVIGTDHAAENITGFFTKFGDGGADLLPLFRLNKSQGK^^ 

EIDAYLEGKWSDKSRGIIENWWYKGQHKRHLPITIFDDFWK 

SeqID 23 9 

IKKESVIKLLKYAFGIIMGFIILAIVIGGLLFAYYV^ 

SIPLNLWAITSIEDKRFFKHRGVDIYRILGAAWIINLVSSNTQGGSTLDQQLIKIiAYFSTNKSDQTLKRKSQEW 
LQMERKYTKEE ILTF YINKVYMGNGNYGMRTTAECS YFGKDLKEL S I AQIiALLAGI PQAPTQYDP YKNPESAQTRRNT 
VLQQl^QDKKISKKEYDQAVATPVTDGLKELKQKSTYPKYMDNYLKQVISEVKQKTGKDIF 

QL YDI YNSDT YIAYPNNELQIAS TIMDATNGKVIAQIiGGRHQNENI SFGTNQS VLTDRDWGSTMKPI S AYAPAIDS G 
WNSTGQSLISTOSVYYWPGTSTQLYDWDRQY 

NAI S SISTNS SSDAKYGAS SEKMAAAYSAFANGGTYYKPQYVNKIEFSDGT3TO 
TFGTGTKAAIPGVAQAGKTGTSISTCTEDE:!^ 

QLDIATEVYRAMMSYLTGGYSADV^MPEGLYRSGSYLYINGTTTTGTYSSSVYKNIYQNSGQSSQSSSSTSSEKQKE 
DKNTANDANS S SPQVETPNNGNATTPISHtfSNQTVPGTGHGN^ 

SeqID 240 

MLDLKRIRTDFDWAKKLATRGVDQET^ 
TLSADIKAIDAELADVDANLQSMVTV^ 

GAKVTGSRFLFYKGLGARLERAIYSFMLDEHAKEGYTEVIPPYMVmDSMFGTG 
EVPLTJNTnfRDEIIDGKELPIYFTAMSPSFRSEAGSAGRDTO^ 

LQKIaNLPYRVITLCTGDMGFSAAKTYDLEVWI PAQNTYREI S SCSNTEDFQARRAQIRYRDEVDGKVRLLHTLNGSG 
LAVGRTVAAILENYQNEDGSVTIPEVLRPYMGNIDIIKPN 

SeqID 241 

OTISNQEMLTPLRGKSGKAYIGTYPNGERVFWYNOT 
KEDMGSKQIIHILLRLHKSRPLWQLLQ 

IVHGDIKHSNWVITTSGLIYLVDWDSVRLTDRMYDVAYILSHYIPQKHW 
SQIIKCFDKRDMEHVNQEIYELRKFRELIKKHNAS 



WO 2004/099242 



73/109 



PCT/EP2004/004856 



MKISQYK^SIRRLKVGAASVMIASGSIVALGQSHI^ 

GVNQTI P YEAFGGDG^LTRXiILKS S EGAKWSDNGVDKNS PLLPLKGIiTKGKYFYQVS LNGKTTGKEGQALLDQIKAN 
QMSSQAKS1TOKAGKKMSMMSKNL.PNTGETKQQ1WGVLGMLSLAFATGLTALGLKKSKQR 

^g^FDYlKVLGQFVGEHELDYLQPQVSAADAFLRQGTGPGSDFLGWMDLPEl^KEEPSRIQKAAEKIKSDSEVL 

WIGI^X^KAAIDFLNNHFANLQT^ 

ArcVFKELLVKKYGQEEAT^^ 

EGMJAARiSlSSDKISENIA^ 

TDLHS LGQF IQEGYRHLFETWRVEKPRKNVTI PELTEDIiDGLGYLQGKDVDFVNKKATDGVLLAHTDGGVPNMFVT 
IiPTQDAYTLGYTIYFFELAIGLSGYLNSVNPFDQPGVEAYKRlIMFAIjLGKPGFEEIjSAELNARL 

SecrlD 244 

MKEKQTAGRRQLEEFAPEFARYNDDILFGEVWAKEDHLTDKTRSIITISALISGGNLEQLEHHLQFAKQNGVTKEEI 
ADIITHLAFYVGWPKAWSAFNKAKEIWI 

SecrlD 245 

AWR^EDAQKSGKVKSIGVSNFLEKDLENII^GHVKPAWQII^IGNTPFDLIDYCQSKGIQVEAYSPIAHGO^ 
KSDGIQKMAEKYGVSVAQLCIQYLLQLNLIVLPKASSKEHLQSNLDFDFVISDEDMSILKSLMFDDYGEFSNFPVFS 

EK 

MFRRSK^SYDTSQTKQRFSI^^ 

NAYIDLYDVKNGLIDPQNLIVLNPSSYSANYYIKQGAKYYSNPSEITTTGSATITFNILDETGNPHKKADGQIDIVS 

VNLTIYDSTALRNRIDEVIITOANDPKWSDGSRD 

IKYSPEAKHRTVEQHAELDAIOJSIANTDEIJSNSTY^ 

KTDOTAFTLTTOPECMTVKVD^ 

TLADKGAITFTAI03KAGNQAEPKTVTIlSrVLPPDKIKYSPE^ 
PDTSTSGEKDGIVEVHYPDGTVDDVNVKVTVTSKKTDOTAPTLTVTPEQQTV^ 

KAKYEISTOIIGARVKIKYLTKEPNKKVMEVTIMK^ 

RTVEQHAELDAKDSIANTDEIjPSWSTYNWKNGHKPDTSTPGEKNAVVVVTYPDKSTDEVPVKVTVVDPRTDAEKNDP 
AGKDQTVKVGEQPDPTKSLEAVPAGSTVAYKEPVDTKTPGEKNAIVVVTYPDKSTDEVPVKVTVVDPRTDAEKNDPA 

GKBQTVIWGEQPDPTKSLEAVPAGSTVAYKEPVDTCT^ 

KDQTVKVGEQPDPTKSLEAVPAGSTVAYKEPVDTKTPGEKNAVV^ 

DQTVKVGEQPDPTKSLEAVPAGSTVAYKEPVDTKTPGEI^^ 

TTVPQGTPISDEEITGLWIPEGSNGVPKWGDRPNTDVPGDYKVTVEVTYPDGTKDTVAWVHVTPKPVPDKDKYD 
PTGKS QQVNGKGNKLPATGES ATP FFNVAALTI I S SVGLLS VS KKKED 

SeqID 247 

MQI LED YDGRAL PKLETDRIiILRQRTVGDVPAMFDYVCLEEVAYPAGLS P IAS LEDEYD YFENRYYQNLEKAKLP S G 
YGI TVKGSDRI I GS CAFNHRREDDVFE IGYLLHPDYWGHGYMTEAVAAL IEVGFTLLNLHKI EI RCYD YNKQ S QRVA 
EKLGFTLEATIRDRKDNQGNRCVNLIYGLLRSEWE 

SeqID 248 

MSKELSPKYNPAEVEEGRYQTWLDQDVFKPSGDTEAKPYS IVIPPPNVTGKLHLGHAWDTTLQDI I IRQKRMQGFDT 
LWLPGMDHAGIATQAKVEERLREQGISRYDLGREKFLD^E 

AVRKVFVDLYNKGWI YRGE F I INWDPAARTAL SD IEVIHKDVEGAFYH^I^^^LEDGSRALEVATTRPETMFGDVAVA 
VNPEDPRYKDLIGQNVILPIINKPIPIIM 

defngitorfear^wak:^ 

PPRF1STDTFMSWMENVHDWVI SRQLWWGHQI PAWYNVNGEJVTYVGEDAPEGDGWTQDEDVLDTWFS SALWPFSTMGWPD 

TEAADFKRYFPTSTLOTGYDIIFFOTSRMIFQSLEFT^^ 

ADALRWFLSNGSAPGQDVRFSYEKMDASWNFI]^ 

LNEWGKOTESFDKFEFGVAGHILYNFIWEEFANWYVELTKEVLYSDNEDEK 
TEEIFGQYAEGSIVLASYPQOTATFENQTA^ 

SNYIKRFTNPETLEI S SAITAPELAMTS I ITGAEIFLPLADLLlWEEEIiARLEKEIxAKWQKELN^ 
NAKPEWQKEKDKQTDYQTKYDATIARIEEMKKLNND 

SeqID 24 9 
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MVEPIISIQGLHKSFGKNEVLKGIDLDIHQGEVWIIGPS 
KMREKMGMVFQQFNLFPMCTVLENITLSPIKTKG 
LAMNPDVLLFDEPTSAXjDPEMVGEVLTVMQDIa^^ 
KEIRTRDFLSKVL 

SeqID 250 

MNYKE I YQEWLENDS LGKDIKSDLEAI KGDE S EIQDRFY KTLEFGTAGIjRGKIjGAGTNRMNTyMVGKAAQAIjANTI I 

DHGPEAIARGIAVSYDVRYQSKEFAELTCSIMAANGIKSYIYKGIRPTPM 

YKAYWKEGSQILDDIMQIANHMDAIT^^ 

VYTPIiNGVGNLPVREVLRRRGFElSnTrTVPEQEMP 

ALEV1HDSKGEYIFLNGNKI GALLS YYIFSQRC^^ 

ANEYD I S KDKT YLFGYEE S I GFCYGTFVRDKDAVS ASMMVVEMTAYYKERGQTLLDVLQT I YDEFGYYNERQFS LEL 
EGAEGQERISRIMEDFRQDPILQVGEMRLENSIDFKDGYKDFPKQNCLKYYFNEGSWYALRPSGTEPKIKCYLYTIG 
CTEADSLSKLNAIESACRAKMNSTK 

SeqID 251 

MYREITAVEHDREVSESNQTNLLQ^ 

NLDIVTKVLKDLiKAFGKKQRALFIKCDPLIYL 

FQANLYANQFGLDKMSKKTRQAIRTSKNKG^ 

ITMASLDVAKRLEKIEKECQIAQSERIKSLELNREKK^^ 

GNTSENIYAGMDDYFKSYSAPIYTWFETAQRAFERGNIW 

LLYKASNYVYALRKKRNS 

SeqID 2 52 

MACTTILVGKKASYDGSTMIARTEDSVNGDFTPKKLKVM^ 

IWGEAGINSKNVAMSATETITTNSRVLGADPLVSDGIGEEDILTLVLPYIQSAREGVERLGAILEKYGTYESNGIAF 
SDTEEIWWLETIGGHHWIARRVPDDVYVTNPNQLGIDHFEFN^ 

GS QRDKDRH YNTPRS WAMQRFLNPEIEQDPRS LFI PWCQKP YRKITVED I KYVLSDHYQDS VYDP YGPEGDAVSRRA 

FRSVGINRTSQTSILQLRPNKSLETTGVQWLSYGSMPFATMVPLFTQA^ 

ADPHFYQHEADIESYIERTMAQGHAHINGVDREVAENKEI 

MGD 

SeqID 253 

MRKKFLLLMSFVAMFAAWQLVQVKQWADSKLKV^ 

IQDSNAFVYI^DNMETWAPKVAKSVKSI^^ 

VVENIRNKFVKAYPKDAASFNKNADAYI^ 

AE P S S KRLGELAKY I KKYN INY I Y FE ENASNKVAKT LAD E VGVKTAVL S P L E GL S KKEMAAGED YF S VMRRNLKVLK 
KTTDVAGKEVAPEEDKTKTVETGYFKTKD^ 

TAGYKTDVES I KIDGKKHQMTFVRNGKS QT FTYKYAGYKI LTYKKGNRGVRYLFEAKEKDAGQFKYI QFSDHGI KPN 
KAEHFHI FWGSESQEKLFEEMERWPTYFPAKMSGREVAQDLMSH 

SeqID 254 

MKLSKKIjLFSAAVLTMVAGSTVEPVAQFATG 
ISNYAKXiGDNVKGLQGVQFKRYKV^ 

YLYVEDLIOSTSPSNITKAYAVPFVLELPVANSTGTGFLSEI^ 

LKSTIPANLGDYEKFEITDKFADGLTYKSVGKIKIGSKTLNRDEHYTIDEPTVDNQNTLKITFKPEKFKEXAELLKG 
MTLVKNQDALDKATAm'DDAAFLEIPVASTINEKA^ 

VKKD S TETQTLGGAE FDLIjASDGTAVKWTDAL IKANTNKNYI AGE1AVTGQP I KLKSHTDGTFE I KGLAYAVDAHAEG 

TAVTYKXiKETKAPEGYVIPDKEIEFTVSQTSYN^ 

MAFAVKGMKRRTKDN 

SeqID 255 

MKKIlQKIVraGLSVTLLILSQIPFGlLVQGETQDTNQ^ 

GEATFENI KPGDYTLREETAPIGYKKTDKT WKVKVADNGATI IEGMDADKAEKRKEVLNAQYPKSAI YEDTKENYPL 

VNVEGS KVGEQYKALNP INGKDGRRE I AEGWL SKKI TGV1TOLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNS 

NSMNNERAHNfS QRALKAGEAVEKLIDKI T SNKDNRVALVTYAS TI FDGTEATVS KGVADQNGKALND S VS WDYHKTT 

FTATTHNYS YLNLTNDANEVlSriLKSRI PKEAEHINGDRTLYQFGATFTQI<ALMKANEILETQS SNARKKLI FHVTDG 

VPTMSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYR 

VPQNQLSVMSNEG YAINSGYI YL YWRD YN1WYPFDPKTKKVSATKQ IKTHGEPTTL YFNGNIRPKG YD I FTVGIGVN 

GD PGAT PLE AEKFMQ S I S S KTENYTNVDDTNKI YDELNKYFKT I VEEKHS I VDGNVTD PMGEMI E FQLKNGQS FTHD 

DYVLVGNDGS QLKNGVALGGPNSDGGI LKDVTVTYDKTS QTI KINHLNLGS GQKWLT YDVRLKD3STYI SNKFYNTNN 

RTTLSPKSEKEPNTIRDFPIPKIRDVREFPVLTISNQKKMGEVEFIKW^ 

PEGSDVTTKNDGKIYFKALQDGNYKLYEISSPDGYIEVCT 

I TNTPKRP PG VFPKTGG IGT I VYI LVGS T FMI LT I CS FRRKQL 
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SeqID 256 

GAKKAGWTE YAE^LSVREQVDHVMI PKINQDLPI YA.GPEEDNLQRGVGHLEGI SLP IGGAS THAVLSGQRGMPAARL 

FADLDKMKKGDYFYVTNLKET^^ 

GSLSFWMFKAMRILLLKFLKLRKP^^ 

SeqID 257 

MSDVVEKQTAKS FIiyOWLNGIjALGTVIVLI PGAILGELMKALLP 

NPIQSASLGIiAVMFAGGAATFLKGAIMLKGTGDI inmgitaalgvlliqflsdktks ftlivi ptvtlllvggvghv 

LLPYVKMITTMIGQGIASLLGLQPVLMSIM 

WSVNSKGTALAHVLGSPKISMANV^ 

WS VMNMIiLI III FVAAP 1 1 LNFI FNYLF IKVLKI IDPMDYKLD I 
SeqID 258 

MARPLVEQTADRLLHLILEREYPVGAKLPE^^ 

PLGFSLIKDTDRLTSDLFELRLLLEPRIAELVAYRITD^ 

GNIAMDSLkPVINQSIHLINAOTTiraQMKS 

SeqID 259 

MIKKNIO/FLGVLLVLWILGGGVLFYQSQFQKT 

EYSYHVDAKTGQILERDMDNNGFSKSTSQSSSSSSQKSH^^ 

DXDFVDLKNK^EVDYQIDAETGKIIERSRDHMND 

SeqID 26 0 

LNRKKRYRLTVKKQNASIPRRLNLLFFIIVLLFTVLILRLEQMQIGQQSFYMKKLTALTSYTVKESKARGQIFDAKG 
WLVENDERPT VAFSRGKNI SSQSI KELANKLSHYITLTEVAS SDRAKRDYYLADKANYKKWE SLPDSKRYDKFGN 
HLAESTVYANAVAAVPVSAINYSEDELKVVALFN^ 

RRKKGTSLSDILGTISTEKAGLPREEVKKYLKKGYSLNDRVGTSYLEKQYEDDLQGIRQIRKVVVNKKGKVVSDNIT 
QEGKSGRNLmTIDLNYQNK^ 

AKNFTPGSVVKGATLS SGWENKVLRGNEVLYDQEIANIRSWFTRGLTPI SAAQALEYS SNTYMVQVALRLMGQDYNT 

GDALTDRGYQEAMAKLRKTYGEYGLGVSTGLDLPESEGYTO 

RIiAPHWSDIYEGNDSNKFAQLWSITPKTLN^ 

AETFAKNVlSTGQTVSTYNLNAIAYDTlSrRKIAVAVMYPHVTTDTTKSHQXiVARDM 
SeqID 261 

MTVFPKHFLWGGAVAANQVEGAFRTDGKGLSVQDV^ 
VFRTSIAWSRIFPNGDDSAPNEAGLQFYDNLFDEIiLKYNIEPLVT^ 
VMERYKDK^YWLTFNEWS ILHM 

PAYPMTSDPRDVLAARQFEQHNLLFSDIHVRGKYPTYIQSYFKNNGIKIKFEEGDEEVLAQNTTO 

AYDFENYQSGQGNILGGLTNPHLTTSEWGWQIDPIGLRLVIjNQYYERYQIPLFIVENGLGAKDQLIETLDGDYTVED 

DYRIDYMNQHLVQVAKAIEDGVEIMGYTSWGCIDCV^ 

KTNGQSLFEHHNR 

SeqID 262 

MATFQIKEKMFSLGGKFTITDQTGLPC 

KKI)FSFLKPHYTIEDLDMEV1CGNFWDIDFQLLKDN^^ 

LEKNASN 

SeqID 263 

MKKITTLILASSIiLLVATTSVKADDNFEMPTRYVKMSEKSKAFYQ 
RISTOIYNLGILISQESKGFIQRIDNAYSLENVSDIV^ 
ELDDMYEHVYLNE SRVEAINRNIQKYN 

SeqID 264 

MKLKKFFEDLIiAKLEYRPIQVFMRHFQSAEiy^ 

NIYEPASRIiAVDAFSKPSTGILGFAjSLTAFWTMSKSIiTSLQKAINKAYGVDQHRDFVISRLV 

LIFSTFSKPVLQIIV1JMYDLGDTLTAW 

NLISQYVLNRVEKMTO 

SeqID 265 

MKLLYTDINHDMTEILVNQAAH^^ 

KES LND JGliAMI F YRALAS FEDGQLKVFGRLKQDAS FI SQL VDL YKEIiQTANLSILDLKYLHSPEKFEDLLAIFLW 
SDLLREGEYDNQSKIAFFTEQTOSGQLDVDLI<^TILIVDGFTRFSAEEEALIKSLSSRCQEIIIGAYASQKAYKANF 
TNGNIYSAGVDFLRYLATTFQTKPEFILSKWESKSGFE 
EHVARAIRQKLYQGYRYKDILVLLGDVDSYKLQLSKIFEQYDIPra 

VLNLFKTGIYGEISQDDLDYFEAYISYADIKGPKKFFTDFWGAKKFDLGRLNTIRQSLLAPLESFVKTKKQDGIKT 
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LNQFMFFLTQVGLSDNLSRLVGQMSE^^ 

ATVDVVTVKSYDLVEPHSNQFVYAL 

LFNAAKQALVLTIPQLLN^^ 

EEQTFWSVAWYLRRQLTSKGIEIPIITDSLDTVW^ 

EQD S IHPDMRHHGTYLHRVFE XLMKNQGI E S FEEKLNS AINKTNQEDVFKSLYS EDAESRYS LE I LEDIARATAT I L 
RQDSQMTVESEEERFELMIDNTIKINGIIDRIDRLSDGSLGVVDYKSSAQKFDIQKFYNGLSPQLVTYIDAISRDKE 

VEQKPPIFGAMYLHMQEPKQDLSKIKNL^^ 
QLLYKKASETIKSGKFLINPYTEDAKTVDGDQFKSITGFEAD 



MSETKVMALREAINVAMSEEMRKDEKV 
LRPIVDLTFMDFVTIAMDAIVNQGAKT^ 

NESKALLKS S ILDNNPVI FLEPKALYGKKEEWMDPDFYI PLGKGDIKREGTDLTIVSYGRMLERVMQAAEEVAEEG 

IISTTEVVDPRTLIPLDKELIIDSVKKTGKLI^ 

LEQGI L PDVAKI KD AI YKVVNKGK 

SeqID 267 

MAFDVIMPKLGVDMQEGEILEWKKNEGDTTOEGDVLLEIMSD 

EEGEEVGT S S PS AD AT I TAEDGQS VSGPAAP S QETVAAAT PKEELAADE YD I WVGGGPAGYYAAI RGAQLGGKI AI 

VEKTE FGGTCLNVGC I PTKT YLKNAE I LDGLKVAAGRGINLAS TNYAID^©KTVAFKNS VVKTLTGGVRGLIjKANKV 

EIFNGLGQWPDKSWIGDKVIKGRJSTWIiATGSKVSRINIPGIESPLVLTSDDILDLREIPK^ 

LWASYGVDOTVIEMADRIIPAMDKEVSL^ 

GRVPQI^GLENLEPELEMERGRIKVNAYQETSI^^ 

AAWTHPEVAMVGMTEEQAREQYGDILVGKNSFTGNGRAIASNEAHGFVKVIAEPKYKE 
STIMENELTVYDVAQSIHGHPTFSEVMYEAFLDVLGEAIHNPPKRK 

SeqID 26 8 

MGKYFGTDGVRGEANVELT PEIjAFKLGRFGGYVLSQHETDRPRVFVARDTRI s gemle sali agll s vgi evyklgv 
IjATPGVSYLVRTEKASAGVMISASHNPALDNGIKFFGSDGFKLDDDRELEIEALLDAKEDTLPRPSAQGLGTLVDYP 

eglrkyekfmestgidlegmkvaldtangaatasarnifldlnadisvigdqpdglnindgvgsthpeqlqslvre^ 

GSDIGI^FDGDSDRLTAVDENGEIVDGDKIMFIIGKYLSDKGQIiAQNTIVTTVMSNLGFHKALDREGIHKAITAVGD 

ryweemrksgynlggeqsghviimdynttgdgqlt^^ 
amevpaiaeiiakmeeemdgngrilwpsgtepllrv^ 

SeqID 269 

MKGQKIIAIAGLVLSCHFALTACHTQE 

SKILSHPDSGIWAHGNHSHFIFYSDLKGSKFSYLIPNAN^^ 

ANGYVVRHGDHFHYIPKASLSQQKQVQASRAVSRLGNQIQNSHYRWSSKIAGLHHPTSDGFIiFNGQGIKGTTPTGIL 

VEHHNHLHFI S FADLRKGGWGS I ADRYQPQKKVDSKKQS PS SKKPRTENTLPKDIKDKLAYLARELHLDISRIRVLK 

TLNGEIGFEYPHDDHTHVIMAKDIDLSKPIPNPHHDDEDHHKGHHHDESDHKHEEHEHTKSNKLSDEDQKKLIYLAE 

KLGLNPNQIEVLTSEDGSIIFKYPHDDHSHTIASKDIEIGKPIPDGHHDHSHAKDKVGMATLKQIGFDDEIIQDILH 

ADAPTPFPSNETNPEKMRQWIiATVTKINIGQRTNPFQRFGLSLMPNIEVLGIGFTPIWDMTPV^ 

GITDYSFLDKMPLLEGLDISQNGIKDLSFLTKYK^ 

TKLQELHLDHBnWKNL^ 

LASLDGIEESDEIVKVEAEGNQIKSLVLKNKQGSL 

TVTNLDFSHISTINF/PTSQLKLN^ 

ADS KKENPKTHDEHHDHEETDHAHVGHHHH 

SeqID 2 70 

MNKRVKIVATLGPAVEFRGGKKFGESGYWGESLDVEASAEKIAQLIKEGANVFRFNFSHGDHAEQGARm 
IAGQKVGFLLDTKGPEIRTELFEDGSDFHSYTTGTKLRVATKQGIKSTPEVIAIiNVAGGLDIFDDVEVGKQILVDDG 
KLGLWFAKDKDTREFEVVVElSnDGLIGKQKGVNIPYTKIPFPAIiAERDNADIRFGLE 
RAICEETGNGHVTCLFAKIENQQGIDNIDEIIEAADGIMIARGDM 

LETMTDKPRATRSEVSDVFNAVIDGTDATMLSGESANGKYPVESVRTMATIDKNAQTLLNEYGR 

VIASAVKDATHSMDIKLVVTITETGBF^^ 

ERVALEAGLVESGDNIVIVAGVPVGTGGTNTMRVRTVK 

SeqID 271 

MTATKQHKKVI LVGDGAVGS S YAFALVNQGI AQELGI IE I PALFDKAVGDAEDLS HALAFT S PKKI YAAT YADCADA 
DLWI TAGAPQKPGETRLDLVGKNLAINKS I VTQWE S GFNGI FLVAANPVDVLT YSTWKFSGF PKERVIGS GT S LD 
SARFRQALADKIGVDARSVHAYIMGEHGDSEFAVWSHA1WAGVQL SVRDAAYS I INK 

KGATYYGIAVALARITKAILDDENAVLPLSVYQ 
QLKDI IDEAWKNPEFQEASKN 

SeqID 2 72 
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MQDKNLVDVNLiT S EMKTS FIDYAMSVI VARALPDVRDGLKPVHRRI LYGMNELGVTPDKPHKKSARITGDVMGKYHP 
HGDSSIYEAMVRMAQXWSYRHM^^ 

LVLPARFPNLLVNGATGI AVGIVIATNI PPHNLGE S IDAVKLVMDNPDVTTRELMEVI PGPDFPTGA1VMGRSGIHRAY 
ETGKGSIVLRSRTEIETTSNGKERIVVTEFPYGWKTKVHEHIWLAQEKRIEGITAVRDESSREGW 

SANVILNNLFKLTSLQTNFSFNMIA^ 

HLDEVITIIRNSETDTIAQAELMSRFELSERQSQAILDMR 

DFVRELVSTSTHDTVLFFTNLGRVYRLKAYEIPEYGRTAKGLPIVNLLKLDEGETIQTIINARKEDVA2STKYF 

QGIVKRTSVSEFSNIRQNGLRAI^KENDELINVTi 

GDKOTGASRIWGQEVLIITEKGYGl^TEASEYPTO^^ 

RTNVANI SQTGRSTMGVKVMRLDQEAKIVTVALVEQEIKDKSNIEDTKE 

SeqID 273 

MTIFDERELKERFTHENRVSFYEFVAKYDAQMVPVMKAKGYRCIHSME 

KLGLKKNVRY S LE FMYQI AS LATMMP YEKVI KWQMMYC I VI TKPT VVKAVKI S RELLiKEKEAYRFFDED I P VDKE P 
VDMIYLEGDGVMVKAREEGLDNRN^ 

TLLITNSDGGHGYTPWFKEIAKALKVKQHEHFWDRYHVNEKIKSFFKLYPVEIi^ 
EALILMEEEMEGFN^^ 

EGQLRDLFFGSWREDYQKYQELENLSAGKIKHEQNKINKRYDLQTIiGRLRYGRHRNL 
SeqID 274 

MAYLSKLSDLDPSLMDADSEQIYIPKVLFEHM}^^ 

LCELLNQ S KTTVI S LKKRLAQYGL I EEVKAGSHQPNRI YLTDKLVP YIKG 
SeqID 275 

MTDNRFAQLKENFEKGSPKRRVPTSRPIAAQKAPESYNKKGRYPFSLHQDVRYDKLEALVAYHGAKSASDYLERLilV 
QEWEKMQRKLKNKEK 

SeqID 276 

MFS WLEALYYTL IQLAKVNRLNALFLVS VVGYLC YQGI KliVRKT IRNF FQLMKGFI GDRENI KKCI KNKKEAIiVHS W 

KHRQDIDWKSTGIODKSKQLWNLMK^^ 

MGIGFSFIAFILTSTYFVTVLRAAI^^ 

WRSAWVNFEQYRDYNHNEEGDDRFATWEIHQQYKKVPNKTETYPGEGGVPVIiHETRKNL^ 

RYLTNAERILGLLSTPSGDYYIDDSTTNLITMGITRSGKGEAHIAPIIDINSRAEIQPSLIIADPKGEHYQSSYKTM 

RRRGYDWVLSFQNMDWSM^ SLFNAI 

AMALI^RANETVRNGETDAWDTVTVRNIAKFLTD^ 

QFSKFRDMADLNFRSSDFASEE 

EYTHKTAKVTITSQAVWGKTTKQV 

EKOTKKRGIWITLDDYTKKP^^ 

NANYELALSNGRKCWRILHILDEFTNIPAIPHI^ 

VYI KSTAPATNE YF S KELGTRT ITRRRRS SNILDEANPNVS I ENPRQELLTPTQIjS KLQEGEAVI LRGVKGRDNAGR 
KITTDPIFLHEKTSLPYRYMFLQEEFDQSMALADIPVESGHRDLDLQDIAVGAQSTFNKIIDWRMALTDRMRTNGKI 

PQIxASRKQTIKALSQSQFTSPADLTQAVIAEVFDEEDDDDLFFVDDVM 
SeqID 2 77 

MNSNTKGHGF FRKS KAYGLVCAI ALAGAFTLATS QVSADQVTT 

TT T PAVNHAT TTDAQADIiANQT QAVKD VTAKAQANTQAI KDATAENAKI DAENKAE AERVAKENKE GQAAVD ARISIEA 

GQAAVDARNKAKQQAQDDQKAKIDAENKAE SQRVSQLNAQNKAKIDAENKI)AQAKANATNAQLQKDYQAKLAE IKS V 

EAYNAGVRQRJSTKDAQAKADATNAQLQKDYQAKL^ innvafdikaqakgvdnaeygns imta 

KTKPDGSFEF3SIHDMIDGWTIGYGKLTGKVNHHWANKDGSVTAFTO 

GRPIFEKAHNGNKTFAETLNKTLQLNLKYEL^ 

TPPKPVKVTPEAEKPVPEKPVEPKLVTPTL^ 

SVSVHPVLVPAANPSKAVIDEAGQSVNGKTVLPNATLDYVAKQNFSQYKGIKASAEAIAKGFA 

KS I KASNGDDVS S LLEMRHVL S KDTLDQKLQS LIKEAGI S PVGEFYMWTAKDPQAF YKAYVQKGLDITYNL S FKI KA 
NFTKGQI KNGVAQ IDFGNGYTGNI VVlSnDVTVPEVHKDI LDKEDGKS INNS TVKLGDEVT YKLEGWVVPANRGYDLFE 
YKFVDQLQHTHDLYLRDKVVAKVDVTIjKDGT^ 

IWKRI KAGDVYNTADL YVNGYKVKS EAVVTHTTEKS KPVEPQKATPKAPAKGLP STGE ASMTPLTAI GAI I LS ALG 
LAGFKKRQK 

SeqID 278 

MKQIKIITGLWATLSAWGNVYAEDITPTAPVNEPQVSSETAKTPQVTESQWSAKOT 

QKQKDQS QQNLVKATS TVTEAEKVAAEATPEVVKEAI KAWEAKEAWDAEANVVDAQKTEQKANQE VQSQAKTVDE 
NVKWADKESEVKQAEGWTTAQEAIDSKTANTNASEAEKAVTEKQTKLETAETNLTEAQKQD 

VNKQLAVTDTQTLIiKKLVTE IHNEKVS T S LENQAYFNQRDGSWAGYYGNYTFAATGCVP S S LAMVFTELARRQI T PT 
EIANYLWNNSNEFNKNYGGTSGKGLVQATKHFGWPTHLASQSAIVEA 
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RGY SNGNTYVYDPYNRANIGWYPVANJjWNEQSRDAIDTS SVGVPFFKITTQKMAQLEAQKAQVQS SLNTAKNQLAKT 

QDVLRTLEATPLKTPEAQAKLNQAKEALAL^ 

VASQVKLADLKATLATVElSnNrVX 

AKLKS LQAVQAEAQKQYHVVFEAYKAVRDAKEQAKLiAES YNHI I ARGGEAI P WDETDKITGYVDGS QKAVANEVTL 
ALTSNGAPXiES PVNKENQNVTKS S QALPHTGEAGLS ILS VLGVGLI STLGLTSLKKRRPH 

SeqID 27 9 

VRLTHVSLWMPSKRS WPLKLQLLRKG I SLLRRT 

K^PIIiVGEAGVGKSAIVEGITLAILRGQVPEPLKGLTWSLELSSLMSEDDEGFIAKFKK 

DE FHT I I GAGS QNGQALD AG1WI KP VLARGD I QL IGATTLDE FHE YI ETDRALERRMQPVMVEEPT I SQAI T 1 1 EQA 

KVXYEKFHGIQISSDAVHQAIRLSVRYLT^^ 

KGDQERLEGFKERLMNRWGQEDAIEAVVDAWIA 

IRFDMSEYKQKEDWKLIGNRATRIKGQLTEGVKQK^ 

NTIVIMTTNIGAKXIINKWEL 

L S E I ADRMKRQNLTLS YEP SLI QYLSDVGTDVKNGARPLERLMKRKVXtAP I S VKS LQLDKS KQGYNVHLWVEGRAPD 
GNHRQEQRQIHMEIEGERDNFFS 

SeqID 280 

MAEENAQQP S LRGKSRRERVEFARSRD I LDVAHELNMELFRDGKNYRWKEHDSMVITPATNQWYWF S QRQGGDVIAL 
VETIKEIGFNQAFEYLNEGTFKEFTWNQVKEPFSYYLEPYEQ 

KVGDMIEPVLVFKNLDKNGQVVGAALQGLVAAPDKYFGRGYLK^ IDLMS Y 

YEIHKDSLSDWLVSLEGLKTGTIGRHLIQLRAEMERRPLSSSWTDEILAQGLDEAVKQGYFKDGKNSHLLTLAVDN 
DVKGKQLIEELKDKSIPVIDATPPKAEGQSKMD^ 

NLGAGDS 1 1 QAFHHQLEDRRYVINQAELYVEESSNDGATGYLS IEGHVLDKDGI SDYLSDQALTDAEKVAFLETLQT 

ELPDXWDEIVNHYDKTVFEEVVVKYGLREKHADIIQEQEL 

SPGTAPQPVEPTAQPDFPTNVRLHFTTDDG1M 

GVQVLSVHFGKKNWMHLTGVT^^ 

DLSSVQKLNNLDLSQALNPEDSDLLLLFRDEGLHQVPASLMRIKG 

EYVKD SGQEMLS ILQNKHYEEALDS GQEMVQTDGF SAEDFTKVLDAVYHVGVPDDLARVPEGVLPVWQKYLEVS EEN 
QTOLEQMIDYADJOSrSLLVKDSAFYKEWKED 

HQRRQE S QLPYTKTE FDI YAPGGQLI KDNVHYAIGDETRPVS QLMGLGYRRLPGYQELAVIDNS ILS QLENKELNQE 

I ASE ANEHS LNS QE I PKEDNYPREAFT S PKQD IKKGLAQRVEE IVAEDATKI LVS S IPQVQENLS VEGNLVGT PQAD 

WRMLYTNLEDFGQDYQLEJ^WSPKRVDFLEDVQAPWTLALIRKEKKIGYLAYGSDWAKEFQIEEELEHLAAQIGDE 

KVPEGLYKQAEVEAFIASHQGNESLQEPIPTIVAEPFDYTSASAYEISEHAFQKIREYTQSPEDLLEYMDFMSKFPQ 

liSPRJWALIHEQWRGANAVATYEQWKAMGEALGIKPDDWQTK^ 

LMVKMIPVLDENGQQLK]TOKGNPKYKK^ 

SYPKAMPNRHETm^XKV^^ 

ELAHATLHNPKLEKQYKELPKGQKEFEAEMTSYLLSKHFGL 
SKMLKQVEKYTNPHQLGRGKEHGLNFPKAPTKGSSR 

SeqID 281 

lynDVS S S PNI TFMLQYTEANPQYVD YTNREEA 

RS YATEEQSKELTAI FTQEADYLQKLRLIDLKNKLE SAYQNGS LLWQGVI S FDNAFLAEQGL YDVATGQVDQKAIKA 

VMRD^PTLIQKEGLSDSAFWWGNIHLNTDNIHM 

LLKEETRSNLLRKEQILANLK^ 

SYLI\WSGSAAYQEFLKETRDFLQTYEGVYSAEKN^ 

ILRS FREAAPQIQDVQLEKNLES FSVLNQKKILEQHPEASVVKSQKAWQKLGYFVKAGEQPLE I IRPVYKS YDKHGK 
GIGRPEFVSDTVYDISQLTENIQLKSLTLKDLSLFSSNELKELVDAAKLKTNPTERERRELGTYRYALKLSILESSQ 

IQLPIRQLWTELGLVHHIQDENILTLLKGTSTT 

AKGFSELKRLYDTLQPSDDGQNQISQAVSKQLQERKVIKI^ 

RARSDEREEQEERRQAQR 

SeqID 282 

MKKNKFLLVS IVFII IFVVQPQNFQSLKNIFTQNDIASQLNI S S SPEEKNDGLGTAYQTQNEDLKSKS FDGQHQVIV 
VTSTEKAQFTAEELSMRNGSWEKYD 

HLIGFQLSGENANVKNLFTGTRALNANFiSrDDKSSMVYYEN^ 
IEDETISFDVYIFNGQPGYDIDYLTGSSEKIMITK 

SeqID 283 

VTKEIKIRSIPEKTWAQLHMISEEYEYPSFNEFMLAQLQRIVENG 
KliLAYHAKQDIVEELTTDWLRFMDDVDALAAERGAGGR 



SeqID 284 
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LLYYFIYLIKVIGNGLKLSLICGL^ 

TALSRDFKRMKTPLFLGVMYW^ 

YLGIRKFTDSLPPE:^^ 

DTIYYPIFTVHPFRNLEGKSDFYHILMKLKLSRKAAFTKNGERLLIRDF 

SSef^qlinsepigfidpftdlgefdsiqm^ 

EEQEVHNRVKNKKS KKYVHE IVTT YLKLGFRFKE I EARVSLFNTRLRRNWKRSDYVTTD3SIPEFYLKKDLQNGYCS PN 
SFLPRSMKIN 

SlKCP^^ 

DYEVQLKAANEQVEFYKNFKAQQSTKAVGESLEHYAETEFN^ 

LEFLSIMFEMKNESDDTIKKHKNEDFFKELDKDRREKSC^ 

IQLIGILRNAALNTLKYKQELALME^ 

TT S ENQIiRIiAI^KXjDDVS VKKLTRKNPTMKAKFD ALKD 

SeqID 2 87 

MNHFELFKLKKAGLTNLNI^ 

SNYPLELKEIYNPPVIOiFYQGNIELLSKPKI^^ 

KNGGSSIAVIGSGLDVYYPTENKKLQEYMSYira^^ 

TCERALEEGREVFAIPGNIIDGKSDGCHHLIQEGAKCIISGKDILSEYQ 

SeqID 288 ^ 

MTERTFEDIELDLKLFQIKtiDNA^ 

LSYDNRINLVSKEIMDYENALDKLYYEKQSLMQKSNERKGG 
SeqID 28 9 

LFNKI GFRTWKSGKLWLYMGVLGS TI ILGS S PVSAMDS VGNQ S QGNVLERRQRDAENRS QGNVLERRQRDAENRS QG 

NVLERRQRDAENRSQGIWLERRQRDAENRSQGWLERRQRDVENKSQGNVLERRQRDAENKSQGNVLERRQRDAENR 

SQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENK 

ENRSQGWLERRQRDVENKSQGWLERRQRDVENKSQGlWLERRQRDAENRSQGWLERRQRDAENRSQGISr^ 

RDAENKSQVGQIilGKNPLIjSKS 1 1 SRENNHS SQGDSNKQS FSKKVSQVTNVANRPMLTIMSRTI SVINKLPKTGDDQ 

NVIFKLVGFGLILLTSRCGLRRNEN 

SeqID 290 

MKRLTYYFKGYIKETIFGPLFKLLEASFELLVPIVIAKMIDETIPRGDRSSLLLQIGLIFFLAAVGWVAITAQYYS 

SKAAVGYTRQLTEDLYQKVMSLGK^ 
LTIWFLVMVVTLFIIVFVMSRI^ 

AGRLtS SLVTPLTFLVVNI TLWI IV^GNLNI ANHLLS QGMLVALINYLLQI LVELLKMTMLVT SLNQ S YI S AKRI I A 
VFERPSEIIDDKIiEPKYSNKAIiEVQEMAFSYPNSSEKA^^ 

GDID I YHQGKS PDT I SNWRTLVRVVPQNAQLFKGTIRSNLS LGLGKVSEEKLWTALE I AQASDFVKEKDGQLD APVE 
SFGRKFSGGQRQRLTIARALVQDKIPFLILDDATSAIiDYLTEARLFKAITKHFNQTNLIIVSQRINSIQNADRILLL 

DKGKQVGFDNHQSLIiAHNKVYKSIYHSQNFKEEE 
SeqID 2 91 

MKFNEQSNSQAALLGLQHIiXJUVTYAGSILVPIMIASALGYNAEQLTYLIATDIFMCGIATLL 

GCAFQS VAPL S I IGAQQGS GYMFGALI AS GI YVVLVAGI FS KVANFFPP I VTGS VI TT IGLTLI P VAMGNMGDNAKE 

PSLQSLTLSLOTIGWLLINIFAKGF^ 

ILMMCIIATVSMVESTGVYLALSDITNDKL^ 

IYFTALFLVILGLLPKFGAMAQMI PSPVLGGAMLVLFGM^^ IAAVS IAAGVGFNGTN 

LF I S LPNTLQMFLTNGI VI S TLTAWLNI I LNGLPKKL I 

SeqID 2 92 

MAOTYDLISQRIEAQRQKLIAIDIVAVASSLGLNLKQGSGGHLYWDEHDSFHIYPQTNTFRWW 

IQEELTGKKPSFRETWFLETGQFESVTVTPVVREPFKHYLAPYEHHNFDLGRQYLKEERGLSDETIDF 

SATLKKGDYFEPVIIFKSFAEDGRMIGGSLQGIVENKVQHPERGRIjKQIMKHSDGIlAGFHLDV 

LLS Y YELHKE S LQNVRLVAMDGVKKGVI SRYTADLLTDGQYS QTMPRE S IRGAIDAINQTTRI LKNNPNMI TI AVDN 

DEAGRNFIKELQEDGIPINTOLPPRKEHQSKMDWNi^ 

DDPQGSAKPVSKRDTFEQAOTSHPTFSYPLLQFSTEEAFVSN^^ 

DRQVTYMLKGDKEIKALQVSFAKDKFAHLTGIRPIGKG^ 

SQSKSFVFTDLEEVQKMRNLKASHAIQSNNRSLVVA 

GNITVLSVHSTOKYIQDGGQALKDMIK^^ 

TPDNVTSKANGTDPTNASDNEVTRQQEANKR^ 
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JSqegke^ 

TLSTLIHEMAHSEIjHNKQSLTERFDGKLTRSTKELQAESIAYVVS SHLGFDTSQESFPYLASWSKEKDGLANIjTAQL 
E IVQEEA3OTLMERIDQQLS QYQTVTLNKETQQLTKQEMKKQTHP FYQ SLAAAKT SRAQVTTQEKEAS VKKDNRPTMP 

^SQETKGHGFFRK^ 
VTTTTAVNHATTTDAQi^ 

AGQAAVDARHKAKQQAQDDQKAKIDAENKAE SQRVS QLNAQNKAKIDAENKDAQAKADATNAQIjQKDYQTKLAITIKS 
VEAYNAGVRQRNKI)AQAKADATNAQIiQKDYQAKIjALYNQALKAKAEADKQS IISnSTVAFD I KAQAKGVDNAEYGNS IMT 

DGRPIFEKAHNG^ 

PTPPKPEKVTPEAEKPVPEKPVEPKLWPVLKTYTFVKFIPREYKPVPSTPETFTPEK^TPAQ 
YKVAVHPVQIPKATPTKKVLDENGQSING^^ 

VKS I KASNGDDVS S LLEMRHVLSKDTLDQKLQS LI KEAGI S PVGEFYMWTAKDPQAFYKAYVQKGLDIT YNL S FKVK 
KE FTKGQ IKNGVAQIDFGNGYTGNI VVNDLTT PEVHKDVLDKEDGKS IKWGTVKLGDEVTYKLEGWVVPANRGYDLF 
E YKFVDHLiQHTHDLYIiKDKVVAKVAITLKDGTVI PKGTNLVQYTETVYNKETGRYELAFKAD FLAQVS RS S AFGADD 
FIWKRIKAGDVYOTMF 
VGFKKRQQ 

MNQIKI I TGLTVATIjS AVVGNVYAEDITPTAPVNEPQVS SETAKTPQVTES QVNS AKVTADQAT SDVNAQKNVVNNA 

QNQKNQAQQKLWATTTLNETQKLV^^ 

HDVATKTADVKQAQASVDTAIODALT 

AKQNLSAKDTQLSQANAEVNHHKFCT^ 

ADYLYNNTDEFNKRFGGTSGKGIISATKAFGYVVT^ 

S SNGNTYVYD PYNRD2STMGF YS VDRIWNEQSRD S IDTAGVGVPF FAIMTKNMANALTKQS QALAS QQVAQKQLNDAQA 
KATGLNAVTMQTP I AQAWL I KAQSNLKDAQKRLAEAQAS VKLANQDNV^ 

B KTKLNQLKQVLAEAS QQVAQANQDYKQAKDNLTQKTAYLTNLRNAQANL I KAQSDVAQAKDNLANKI AKLQREVAY 

LQELKTKAVDAQSQYQKVLSAYKSVLSAKASLKLAEEKARLDKKGHEAVAVVDET 

TTDVKQVSVAKASVLPSTGDVKQVSVALIiGMLLTFSGFLGIRKQSKKVIN 

SeqID 295 

MISRKVALVTGASAGFGAAIWKLVSDGYSVIGC^ 
QSIDLLVHNAGLALGLDKSYEADFEDWMTMINT^ 

SKAFVKQFSLNLRADI^GTKIRVTNLEPGLCEGTEFSTVRFKGDHKRVEKLYEGAHAIQAEDIANW 
NINRIEIMPVSQTYGPQPVYRD 

SeqID 296 

MIYLDNAATTALTPSVIEmT3WMTSlTC^^ 
KGYALANQLKGKHIITSEIEHHSVLHT^ 
LPIQEIGQLLRNHQAVFHVDAVQVFSKMELDPHSLGI^ 
ASTENIIGIAGMSQALTDATTNTLKl^ 

AGFAVSTGSACTAGTVEPSHVIiTSIiYGANSPRLNESIRISFSELNTQEEILELAKTLRKIIGD 
SeqID 2 97 

MSRKTFKHILSIGVCTLVLSMSLYYTEKAHAIAGPSDRQYVENPN^ 
SKEEILDYIARNLNSSVGGESIOWQYSNIEFKESAYLKR 

PIVTETHEVRWGTPVAISHRIYFVEESSGKVLDEYTNLHTADSELNGYRVGDYITDYALSKSAYEAFLNSRLDKEGY 
KLQHRISTlWRQNLQIDKLIFl^DFNEENIYYQIGNIRPLLSRSSAEVESDIITERYYVSKNAKSLARTESTISi™ 
VDAKTEQPLFNHTLTGYQIxATVSHVYNRLFEENLIPTTKSGERYFIQNMKKTAEQEYT^ 
ARPVDWDYHSGASGSLENQPNIYTEEDSTEFLGNKPQAACYPNKQFACENTDSKYNYSYLEK 

SeqID 2 98 

^PLIIGMNDKQAEAVQTTDGPLM 

ATQDTLIATFHSMCVRILRREADYIGYmOJFTIVDPGEQRT^ 
AYEKQAGDMYTQVIAICCYKAYQEELRRSEAMDFDDLIMMTLRLF 
KLLASRFKNICWGDADQSIYGl^GADMQNILDFE:!^ 
QNDEGEQIVYHRAlSnSIEQEEAVFVASTIDN^ 

EIRDVIAYLNILANTSDNISFERIVT^PK^GVGPGTLEKIRSFAYEQNMSLL^ 

ILTLRSKLDSLTVTEITENLLDKTGYLEALQVQRFT^ 

D3^IADTDDSATETAEVTLMTLHAAK^^ 

NAOTRTLFGKTSYNRPTRFIREIDDELIQHQGLARPVNSSFGVKYSKEQPTQFGQGMSLQQ^ 
LQALNANNSHETSWEIGDVATHKK^GDGTVLEVSGSGKTQELKINFPGIGLKKLLASVAPISKKEN 
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SeqID 299 

MKLTVQLMVILTFSFAGEVLSTIFE^ 

FQDIKPYLPGIAIIILGALFIiNILTXGriVSQWIKKRYEGDYPEIGGKNGlSrFNE 
SeqID 3 00 

MIFVTVGTHEQQFNRLIKEVDRLKGTGAIDQEVFIQTGYSDFEPQNCQWSKFLSYDDMNSYMKEAEIVITHGGPATF 

MSVISLGKLPVWPRRKQFGEHINDHQIQFFKKIAH^^ 

GEI 

SeqID 3 01 

MVMKX I EIiKEATVQVSNGLAEMKTI LDHVNLS I YEHDFIT ILGGNGAGKS TL FNVIAGTLML SGGMI YIMGQDVTJKfL 

GGQRQALSLLMATLKKPNLLLDDEHT^^ 
VRDLNQAQKNKMAIADYYQLFD 

SeqID 3 02 

MEIKKKimiMLYSALILGTILVNNSYQAKAEELTKTTSTSQIOT 

S TATATATHSTMKQ WDNQTQNKELVKNGD FNQTNP VS GS WS HTSAREWS AWIDKENTADKS P I IQRTEQGQVS LS S 

DKGFRGAVTQK^IDPTKKYEVKFDIETSN^ 

TLELYYEKGTGSATFDNISMKAKGPIO}SEHPQPOT 

ATGSTTVKISDKSGKIIl^VPLSVTASTEDKFTKLL^ 

SNHTFLWKDLDNLISnSTSAQLTATYRRLEDIiAKQITNPHSTIYKNEKAI 

E I GVPRS I TATLALMNNYFTDAE I KTYTDP IEHFVPDAGYFRKTLDNPFKAIjGGNLVDMGRVKI I EGLLRKDNT HE 

KTSHSIjKNLFTTATKAEGFYADGS YIDHTJWAYTGAYGISTVIiIDGIjTQLLPI IQETDYKI SNQELDMVYKWINQS FLP 

LIVKGELIVnDMSRGRSISREAASSHAAAVEV^ 

MNKMLNDSTVATK^ 

HYSNHFWPTVNPY!K3W?TTEKDAK^ 

TAQKGWVILNDKIVFLGSNIKNT^ 

RNIGYIFFKNSTIDIERKEQTGTiraSINRTSKESITSIVSNPFITISQKJHDNKG^ 
ELLENSSKQQVIYDKNSQTWAVIKHDNQESL^ 

SeqID 303 

MHSFSNPGYPYDNAVTEAFFKYLKHRQINRKHYQNIKQVQLDCFEYIENFYN^NPHTANLGLTPNQKEENYFNAIK 
SeqID 3 04 

VKKTYGYIGSVAAILI^THIGSYQL^ 

DQGYVTSHGDHYHFYNGKVPYDAIISEELLjMTDPNYRFK^ 

I AEQVAKGTKEAKEKGIjAQVAHLS KEEVAAVNEAKRQGRYTTDDGYI F S PTD I IDDLGDAYLVPHGNHYHYI PKKDL 
S P S ELAAAQAYWS QKQGRGARPSDYRPTPAPGRRKAPI PDVTPNPGQGHQPDNGGYHPAP PRPNDAS QNKHQRDEFK 
GKTFKELLDQLHRLDLKYRH\TEEDGLIFEPTQVIKSNAFGYWPHGDHYHIIPRSQLSPLEMELADRYLAGQTED^ 
SGSEHSKPSDKEVTHTFLGHRIKAYGKGLDGKPYDTSDAYVFSKESIHSVDKSGVTAKHGDHFHYIGFGELEQYELD 
EVANWVKAKGQADEIjAAADDQEQGKEKPLFDTKK^ 

KDKKHYRYDIVDTGIEPRLAVDVSSLPMHAGNATYDTGSSFVI PHIDHIHWPYSWLTRDQIATVKYVMQHPEVRPD 
WSKPGHEESGSVIPIWTPLDKRAGMPJSnffQII^ 

ADGSSLRTINKSDLSQAEWQQAQELIiAKKNTGDATDTDKPKEKQQADKSNENQQPSEASKEEKESDDFIDSLPDYGL 
DRATLEDHINQLAQKANIDPKYLIFQPEGVQFYNKNGELVTYDIKTLQQINP 

SeqID 3 05 

MKKGFFLMAMW S L VMIAGCDKSANPKQPTQGMS WTS F YPMYAMTKEVS GDLNDVRMIQS GAGIHS FEPSVKDVAA 

IYDADLFVYHSHTLEAWARDLDPNLKK^ 

WIAKELGRLDPKHKDSYTKKAKAFKK^^ 

PSPRQLKEIQDFVKEYNVKTIFAEDl^ 

SeqID 3 06 

LRKKQKLPFDKLAIALMSTSILLNAQSDIKANTVTEDTPATEQTVETPQPTAVSE 

DDANDLAPQAPAKTADTPATS KATIRDLND PS QVKTLQEKAGKGAGTVVAVIDA S KE 

DIiEKAKKEHGITYGEWVNDKVAYYHDYSKDGKT^ 

WGLADYARX^AQAIRDAIISTLGAKVIIOMSFGNAAIA 

DHPDYGVVGTPAAADSTLTVASYSPDKQLTETATVK^ 

GKI ALI ERGD IDFKDKI AKAKKAGAVGVL I YDNQDKGFPI ELPNVDQMPAAF I S RKDGLIjLKDNPQKT I TFNATPKV 

LPTASGTKLSRFSSWGLTADGNIKPDIAAPGQDILSSVANNKYAKLSGTSMSAPLVAGIMGLLQKQYETQYPDMTPS 

ERLDL^KVLMSSATALYDEDEKAYFSPRQQ 

DKPQELYYQATVQTDKTOGKHFAIjAPKAIjYETSWQ 

QDPKKEELMSIPYIGFRGDFGNLSALEKPIYDSKDGSSYYHEANSDAKDQLD 
IKAVKEGVENIEDIESSEITETIFAGTFAKQDDDSHYYIHRHANG^^ 
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EVLDREGISTVVWTSEVTEQVVK^ 

VIVDNTTPEVATSATFSTEDRRLTLASKPKTSQPVYRERIAYTT^^ 

PLKMSDFTYVVEDMAGNITYTPVTKLTjEGHSNKPEQDGSDQAPDKKPEAKPEQDGSGQTPDKKTSTKPEKX)SSGQTP 
GKTPQKGQPSRTLEKRSSKRALATKAS 

SeqID 3 07 

FSVTYSQSERTWFSFGEITFSRSRWTNGFETRIPVDEWLGLE 

TIITKI)CVIjKAVKFVEKLLKEKERYRFYLEEPPERKKVKKIjYVEGDGVMIKSTDSREERRYL 

TKRYELQDKHEILQLNYDKAKYNLLDYIYWNYEVDDDTILITNSDMGKGYTSRVFKELG 

EKIiSSYLRKYPIELTDFALDAVTICi^^ 

GIJffiSQHRKITYRMKRRGMYWSKWGI STMANMI ILERANGIiRELFFGSWRKVYSEYKEGS FS AGRLFKKTDELDKFS 
KPLLKNGRKWSITGIKTK 

SeqID 3 08 

KASSGDYGTKREIITANKDKYSIS 
GITLSRRRIRRIMJO^LNLVSVYQIOVrF 

LYNREIIGLSLGWHKTAELVKQAIQSIPYALTKVKMFHSDRGKEFDNQLIDEILEAFGITRSLSQAGCPYDNAVAE^ 
T YRAFKIE FVYQETFQ S LEEIiALKTEKATLF CTT F I KC CLFRFLPML 

SeqID 3 09 

MKTRNRKGGYLAISITANEYIDSKQAIH^^ 
MVQFENLQACEVGFNVYFKADNLKEVK 

SeqID 310 

LSTFDSVTGRTimLEAYEGLNLSEKFAILSHFDKLRNELQTPSIQLGEFDREMEAFSITLGNELtiGYLEANGSPYEL 
KRELHQAEMMAVIjELSRQLVAKFSTKIjEEI,GIDLGSFQPD 

ELLQMGLSHDKVEFFIiTSQLQLEDMRQVAFAFLHESLTREEAEQFETDRFRHTSLNFREWREIiLEKQEP 
SPLWEVLHHYPLGSRVTYKGQEFEILSIEAADIVto^^ 

REEQPEEEWTLFSFMDEGTEDNEKEPEIVFESTDKVVTLDSQPAQVSETISESVPETKEVIEAEQAIAVDFSFPEDL 
TNFYPKTARDKWANLVAIRLVKEL^^ 

SDMKQSSLTAYYTDPLLIRQMWSKLEQDGFSGGKILDPSM 

NVHIEVKGFETVAFNDNSFDIiVI SITVPFAISIIRIADNQYDKPYMIHDYFVKKSIjDLVHDGGQVAI IS STGTMDKRTEN 
ILQDIRETTDFLGGVRLPDTAFB^IAGTSVTTDMLFFQKHLNKGYQADDIAFSGSVRYDKDERIWLNPYFDGEYNAQ 
VLGRYEIKNFNGGTLSVKETTDNLIASVREA^ 

KGSTIYYRDNKGIRVGTKTEEISYYVDDEGNFQAWESKHSQKQIDRFNNLEVTDSTALDVYWEEPAKRGQFKGYFK 
KAVFYEAPLSEKEVARIKGMVDIRN^^ 

DKYSLLAS LEDES IjD P SGKTVT YTKS IjAFEKAIjVRPEKMVKEVSTAIjDAIjKS S LADGRGVDFD YMAS I YQTAS KAAL 
IEELGDQI I PDPESYLKGQIaTYVSRQEFLSGDIVTKLEVMDLLLK 
GSRWIPLAVYGKFVQEAFMGKISrYDLTATEVEEVL^ 
LLNSNQPTITKQIQEGDKKK3nn?DVEKTO 

EIDGIiAQISriSLRPHQKNAIQRIVEEKRALLAHEVGSGKTLTILG 
FPTKKVYVTTKKDFAKARRKQFV^^^ 

AERSIKGLENQLEELQKLDRDTFIEFENLGIDFLFVDEAHHFK^ 

EYGDRNWFATGTPVSNSISEIiYTJVnynST^IQPDVLERYQVSNFDSWGAFGNIENSM 

PELMRIYKETADIQTSDMLDLPVPEATVIAV^ 

DI^LIDPAYTLSDNQKIMQVVDNVERIYREGKGDKAT^ 

HDANTDEKK3sISLSRK\/NSGEWILMASTEKGGTGLWQARMKATO 

YITKGSFDNYLWATQENKLRYIKQIMTSKDPVRSA 

FNRSKDEYRHTISYCEKNLPVMEKRLRQYDRDIEKSQ 

RT^TFRGFELKMATRSPGEPLSDIWSLT^ 

AVARVEVEK^FAKEEEYQLVKAKYDVLAPLVEREADLEEIDVALSQFSSSDPCLKKDQQLVLDI 
SeqID 311 

MTTKCITOHFLINQEKGEKYVFRKSKQYRTLCSVALG^^ 

P S P I AEQKD SLAATGQ S TGTVTVTVPHDKVTQAVDKAK^ QQLKKAEEDATNQTT 

TISKTVEIYKSDKATYEAEKKWVEKRNEELTAAYDKAEQTGTGLNHSVDTTVSELKSQDQN 

SVSGWQEYVKSVAAIDKKNKANLM 

AGQAAVEQANQEKQKQAANRAFEIATITKRNKEREEVAKKENAAIDAYNAKEW 

ALDIiNHGE PQVKHGAGTRNPDRI I S KGDAMLGGYSNILD STGF FVYNHFKTGETLNFT YQNLKHARFDGKKI TAITY 
D ITNLVS PTGTNAVQLVVPNDPTEGFIAYRI^GAGNWRTDKMEFRVKA^ 
IGLEYVKDSSGKFVPIHGSSVQOTNEGLARSLGSiSn^ 
MPTQVGGKTYWFALNTLPVAKTOT 

LTKVTLPTNLKLEPLPKAPQKPTVHYHDYLLTTTPAIAKEVMN^ 

S L I FED YLPAGYAFDMTKTQAENSD YDLTFDKNKNFVTLKAKDS LLQTLNKELNKS YQL S APKLYGS VQNDGATYSN 
SYKLLINKDTPm^WISNVVRIRTPGDGETTSRIRPKKDNEN 
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JODTIARGFLFTODYPEEALDLVDKGTVITT^ 
QAFYDTYVKTGQSLALM^ 

LVGSQNFYTLSWDLDQYRGLQADKSQIAQGFYFVTO 
QAAIAJO^ITPKGAFQVFMPKDPQAFYKAY 
VPTVHPHKSNTNKEGVS I EGKTVLPISTTVNYYKIVLDYS QYKI^ 
NGKFVRGLSWTYASLADAPKAVQEAMKSR^ 

QYENTAYQIDFGIAYVTETVVNNVPKLEPQKDVVIDLS QKDKS IiNGKAIALDQVFNYRIiVGS LI PRNRATALVE YS F 

KDDYDEKHDEYKGVYKAYTLRDVTLKDGTVIjK^^ 

MKRIASGDVENTVIHSVNGYNIRSNTVK^ 

LLLGIiAYGLSKRKKEKN 

SeqID 312 

MANNNF YBTRD P FGNMDD I FISTS LMCNMGG YNS ENKR't LINGRE\H? PE E F S Q YRQTGKL PGQELNNQNT PTNQVS AD S V 
LTl^GTNLTDQARQHLLDPVIGRI^ 

IS IDI S SLEAGTQYRGSFEENI QNIIKEVKETGNI HjFFDEIHQILGAGSTGGDSGSKGLADHiKPALSRGELTVIG 

ATTQDEYRNTILKNAAIjARRFNEVT^ 

AIDLIDMTAAHLAAQHPOTDLKSLEKEIAAQR^ 

AMS IERLTGVPVS1S!MGASD IERLKELGNRLKGKVI GQISnDAVEAVARAIRRIvIRAGFDDGlSrRPI GS FLFVGPTGVGKTE 
LAKQLAFDMFGSKDAI VRLDMSE YNDRTAVSKLI GATAGYVGYDDNSNTLTERIRRNPYS I VLLDE IEKAD PQVI TL 
LLQVLDDGRLTDGQGNTIWFKNTVI IATSNAGFGNBAFTGDSDKDLKIMERI S P YFRPEFLNRFNGVT E F S HL S KDD 
LNEIVDLMLDEVNQTIGKKGIDL\A7^ 
DGQIVISER 

SeqID 313 

MGRFKEnLESKKALILHGALGTELESRGCDVSGKLWSDKYLIEDPAAIQTIHEDYIRAGADIVTTSTYQATLQGIaAQ 
VGVSESQAEDLIRLTVQLAKAVREQVWKSLTKEEKSERIYPLISGDVGPYAAFLADGSEYTG 

RIELLLDEGVDLLALETIPNAQEAEALIEIjIiVEDFPQVEAYMS FTSQDGKTI SDGSAVAGLAKAIDVS PQWALGIN 

CSSPSLVADFLQAIAEQTDKPLVTYPNSGEIYDGASQSWQSSRDHSHTLLENTSDWQKLGAQWGGCCRTRPADIAD 
LSEHLT 

SeqID 314 

MKIGIDKIGFATSQYVLEMTDLAIARQVDPEKFSKGLLLDSL^ 

TESSI.DQSKAASVYVHQLLEIQPFARSFEMKEACYSATAALDYAKLIWEKHPDSIWLVIASDIAKYGIKSTGESTQG 
AGSIAMLiISQNPSILELKEDRLAQTRDIMDFWRPN 

SGAVAEIFTGQLVDGYQNKLQSDRI^QLNKRQKITVTEYEKLFFEKTILDENGNANFNTYRTGTFSLDSICEHQRIY 
KKINN 

SeqID 315 

MKSAYIFFNPKSGKDEQAIiAQEVKSYLIEHDFQDDYVRIITPSSTO 

GVYAGGAYPTIGL VPAGTVKNFSKALNI PQERNIiALENLLNGHVKS VD I CKVNDDYMI S S LTLGIiIiAD I AANVTS EM 

KKKLGPFAFVGDAYRILKRJSTRSYSITl^Y 

KXiLLHLRQFRKGDFSQAKEIKHFH^^ 

SeqID 316 

MTTFTAKFIDQEWEVPVESGRYHMIVGEFCPYAQRPQIARQLLGLDKHISISFVDDVPSDIGLIFSQPEQVTGAKSL 
RDIYHMDPTYKGPYTIPILIDKTD^^ 

LMYKLAFLDKQADYDTYSKEFFTFLDQKEm 

I FS WAKTLYNDFNLKTLTJSTPQS IKHDTYYLGKFGRAVRHHTIVPTGP]Sny[VKWEK 
SeqID 317 

MKKKX ILKSS VLGLVAGTS IMF S S AFADQVGVQVIGVND FHGALDNTGTAlvIMPDGBCVTNAGTAAQIjDAYIDDAQKD F 

KQTNPNGESIRVQAGDIWGASPANSGLLQDEPTWT^ 

TKSYPHEAAKQEIWAOTIDKVNKQIPYl^^ 

IVKYAKELQAKItt^IWLAHVP^ 

QG^YADVRGVLDTDTQDFIETPSAKWAV^ 

VS PVGSL ITEAQIjAIARKS WPD IDFAMTI^GGIRADLLI KPDGTITWGAAQAVQP FGHILQVVEITGRDLYKALNEQ 
YDQKQNFFLQIAGLRYTYTDNKEGGEETPFK^A^YKSNGEEINPDAKYKLVIOT)FLFGGGDGFASFRNAKLLGAIN 
PDTEVFr^YITDLEKAGKKVSVPWNKPKIYVTMKiy^^ 

STKINPVTTIHKKQIiHQFTAINPMRNYGKPSNSTTVKSKQLPKTNSEYGQSFLM 
SeqID 318 

MAYIWSYLKRYPITOLWL^^ 
SRLTTTMIRDMRNDIV^^ 

LAWLVAVAMPLLVGVVLYVAIKTKPLSERQQTMLDKINQYVRENLTGLRVVRAFAR 
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LFKLTGLTEPLFVQIIIAMIVAIWFAL^ 

VMDMPISINPNAEGVTDTKLKGHLEFDIOTTFAYP^^ 

VTLGKILVDGVDVRDYNLKSLRQKIGFIPQKAL^ 

HLAEGGSNLSGGQKQRLS IARAVVKDPDLYI FDDS FSALDYKTDATLRARLKEVTGDSTVIjXVAQRVGTIiyiDADQI I 
VIiDEGEI VGRGTHAQIiIENNAI YRE IAES QLKNQNLSEGE 

SeqID 319 

MKKGQVTSTDTKQSYSLRKYKFGLASVIL 
QASDKWNSQNTATKDITTPLVETKPMTO 

KWI S YK8F CGVRRYAAIESLDPS GGSETKAPTPVTRFS GSI^QEKIATQGHYTFSHKVEVKINEAICVAS PTQFTLDKGD 
RIFYDQILTIEGNQWLSYKSFNGWRFVLLGKASSVEI<:TEDKEICVSPQPQARITKTGRLTISNETTTGFDIIjITWIK 
DDNGIAAVKVPWTEQGGQDDIKWYTAVTTGDG^ 

NS SQEPIEHGLAKTGVYNI IGSTEVKNEAKI S SQTQFTLEKGDKIlSIYDQVIiTADGYQWI S YKS YSGVRRYIPVKKLT 
TS S EKAKDEATKPTS YPWLPKTGTYTFTKTVDVKS QPKVS S PVEFNFQKGEKIHYDQVL WDGHQWI S YKS YSGIRR 
YIEI 

SeqID 320 

MENWKFALS S ILGHKMRAFLTMLGI I IGVAS VVTiIMALGKGMKDSVTNEITKSQKNLQIYYKTKEDQKNEDNFGAQG 
AFMQGSDTJraKEPIIQESWLKKIAKEVDGVSGYYVTN^^ 

EED YNQFS RVI LLEEKLAQRLFQTNEAALNKVVTVK PEAGS GL YGSNSDGNAILTNTQIiAS E FGA 

KE AENI YFHLNDVS Q SNRI GKE I GKRLTD I S HAKDGY YDNFDMT S I VKS INT QVGI MTGVI GAI AAI S LL VGGI GVM 
NIMLVSOTERTREIGLRKALGATRRKILAQFLIESMVM^ 
IFSAFIGIIFGLLPANKASKLNPIDALRYE 

SeqID 321 

MQYSE IMIRYGEL STKKKNRMRF INKLKNNMEHVLS I YPDVSVKTDRDRGHVYLNGTD YHEVAESLKE I FGI QAF S P 
SFKVEKNVDTI>V]OWQEIMTSWK^ 

RDEAAYI S YENIRGAGGLP VGTS GKGMLMLSGGID S PVAGYLALKRGVD IEAVHFAS PPYTS PGALKKAHDIjTRKLT 
KFGGNI QFI EVPFTEIQEE I KEKAPEAYLMTLTRRFMMRI TDRIRENRNGLVI INGES LGQVAS QTLESMQAINAVT 
ATPIIRPVVTMDKLEIIDIAQKIDTFDISIQPFEDCCTIFAPDRPKTNPK^ 
TIQPQADSDDVDDLIDDLL 

SeqID 322 

MARVKGGWSRKRRKRVLKLAKGYYGAKH 

QLMHGL KLAE IE VKRKMIiADLAVNDAAAFTALAD AAKAKLGK 
SeqID 323 

MFKASKKLVQKNKSNHFWLVFFITLILFLIGCYASLRFGAINFKTSDLITVLKNPLKNSNAQDVIFDIRLPRIIAAI 

LVGAAMSQAGAIMQGVTRNAIADPGLLGINAGAGLALWAYAFIiGSMHYSTIIilVCLLG 

YHQLRLILAGAMISTLFTSVGQWTLYFKLNRTVIGWQAGGLSQINWmLIIIAPIIILGLL 

ESVAKALGQKTQLMTAFLLLIVL^ 

RIINPPSETPISSIISIVGLPCFLWLIRKGKNL 

SeqID 324 

MIKRPIHLSHDFLAEVIDKEAITLDATMGNG^TW 
NLEQYVHTPLRAAI FNLGYLPSADKTVITKPHT^ 
QNNFATMLYQPLNQVNTPPFLIMVEKLQSYENEV 

SeqID 3 25 

MRVHITSIYGQSPRSIALISQKLVKDVGRQLGYDEMGIYFYlSrDHAETHGERSTRMD 

TEFDELFLDKLQAYGARIITFVHDIVPLMFESNFYLLD 

DHPVNIDLPRPECQK^LSFAGDIQRFPFViroWKENIPLI^^ 

WSEDREErjVERRYSRMWASYKLSTFLAAGIjPIlANHDISSRDFIKQHG 

VEKIATLLRNGYITKKLLiIDAVHMLYR 

SeqID 3 26 

MSQKTFGKQLTVVDTKSRVKMHKSGKIS^^ 

GALVGGASLTSRVYADETPWQEQS S S VPTLiAEQTEVTVKTTTVQNHQDGTVS KNI IDSNS VSMS ES AS TSTS ES VS 

MSMSGSTLTSVSESVSTSALTSASESISTSASESVSKSTSISEVSNILETQASLTDKGRESFSAJSTQIVTESSLVTDA 

GKNASVSSLIEITKPKSELQTSKMSNESLITPEKSQVMIASDKTGNESLTPTIRLKSVIQPRSMNLM 

LEEVSDTEMLGKDVSSELQKTTOIAIjKI)NT 

LIWQDIMDETGQL^TGSYSPLTHN^ 

ERVQYNSPSVTEHTJT^TNVRSRIMKL^ 

KILKTNSKNLTDSIVnDQNYDSPEFEDV^ 

YGSKYSYGHTNISGDSDANAEIKLLSESASTSASTSASTSASMSASTSASTSASMSASTSASTSASMSASTSASTSA 
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STSTSTSASTSASTSASTSASMSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSAS 
TSASTSASMSASMSASTSASMSASTSASTSASTSASTSASTSASTSASMSASTSASTSASTSASMSASTSASTSAST 
SASTSASTSTSTSASTSASTSASTSASMSASTSASTSPSTSASTSASTSASTSASTSASTSASMSASTSASTSASMS 
ASTSASTSASTSASTSASMSASTSASTSASTSASMSASTSASTSASTSASTSASTSTSTSASTSASTSASTSASMSA 
STSASTSASTSASMSASTSASISASTSASMSASTSASTSASTSASTSASMSASTSASTSASTSASTSASMSASTSAS 
TSASTSASTSASTSASTSASTSASTSSSTSASTSASTSASTSASMSASTSASTSASMSASTSASTSASTSASMSAST 
SSSTSASMSASTSASMSASTSASTSASTSASMSASTSSSTSASMSASTSASMSASTSASTSASTSASMSASTSASMS 
ATTSASTSVSTSASTSASTSASTSSSSSWSNSSKEIO^SALPSTGDQDYSVTATALGLGLMTGATLIiGRKKSKKIDK 
D 

SeqID 327 

MIDRKDTNRFKLVSKYSPSGDQPQAIETLVDNIEGGEKAQILKGATGTGKTYTMSQVIAQWKPTLVIAmKTIAGQ 
LYGEFKEFFPDNAVE YFVS YYDYYQPEAYVP S SDTYI EKDS SVNDEIDKLRHS ATS SLLERNDVIWASVS CI YGIjG 
SPKEYADSWSLRPGQEISRDQLLNNLVDIQFER]^^ 

lESLTGRVLGEVEHLAI FPATHFMTNDEHMEEAI SKI QAEMENQVELFEKEGKLIEAQRIRQRTEYBIEMLREMGYT 

NGVENYSRHMDGRSEGEPPFTLLDFFPEDFLIMro 

EFESHVHQIVYVSATPGDYEMEQTDTVVEQIIRPTGLLDPE\^^ 

AEDLTDYLKEMGVKVKYMHSDIKTLERTEIIROT 

IQTIGRAARNSNGHVIMYADKITDSMQRAI^ 

DYSSLSKKERQAEIKALQQQMQEAAELLDFEIiAAQIRDVILEIiKAID 
SeqID 32 8 

MKHKLKAFTLALLSIFFVFGGKVSAETVNIVSDTAY^ 

VX^VQSGQADALMAGTTVTEARKKVFNFSDTYYDT^ 

GYKVKTFDTSDLMNNSLDSGSIYAAMDDQPW 

MKSDGTYlSroiiy©KWLGKDATKTSGKATGNANEKATPWPSYKIVSDSS^^ 

FKLD I SNPGFDAALNAVQS GQADGVI AGATITEARQKI FDFSDPYYT S S VI LAVKKGS3STVKS YQDLKGKTVGAKNGT 
AS YTWL SDHADKYNYHVKAFDEAS TMYD SMNS GS IDALMDDE AVLAYAINQGRKFETP I KGEKSGD I GFAVKKGANP 
ELIffl^FNNGIiASLKKSGEYDKLVK^ 

VIGIIFGMMSVSPSNTLRTISMIFVDIVRGIPLMIVAAFIFW 

EIWGGIEAVPSGQMEASRSLGISYGKTMQKVILPQAVRLMLPNFINQFVISLKDTTIVSAIGLVELFQTGKIIIAR 
NYQ S FRMYAILAIMYLVI I TLLTRL AKRLEKRLK 

SeqID 32 9 

MGDKPISFRDKDGNFVSAADVWNAEKLEELFNTLNPNRKLRLEREKIaAKEKG 
SeqID 33 0 

METYTLSNTLNIPKIGFGTWQLTEGEEAYKAVT^ 

DKHDYHLAKASIDESLQKIiGVDYIDLLM 

ALFETAEIKPMVNQIIItkPG^QEDLV^ 

FLPLPKSATPKNIEANLDIFDFQLNEDDITTLIQLDSGIKPKDPDNVSF 

SeqID 331 
MVKGQGNWAQNKKA^^ 

PDRTRKLLLKKREIEKISNELKGTGMTLVPLKOT^ 
R 

SeqID 332 

MDHFTKLWQDFSKLPlSrv^ 
LITOGTDIDILYRNIDNFLSDLEDWEH^ 
NWLLTGKLPSYDKQIIKALKRQDFVSra 
KLCHHNSNEHTVKETLNDIIMHLDViyiLKENFQHFIG 

SeqID 333 

MKKI I YLGLACVS ILTLSGCES IERSLKGDRYVDQKLAENS SKEATEQLNKKTKQAIjKADKKAFPQLDKAVAKNEAQ 

VLIKTSKGDINIKLFPKYAPLAVENFLTHAKE 

FVNEISPYLYNIRGSLAMANAGADTNGSQFFINQSQQDHSK^ 

I^TVDKIASVEVTKSDQPKEKITITSIKVIKDYKFK 

SeqID 3 34 

MKIGIIAAMEEELKLLVENLEDKSQETVLSI^^ 

GAVATGIiNV'GDVVVADTLVYHDVDLTAFGYD YGQMSMQPLYFHSDKTFVS T FEAVLSKEEMTS KVGL I ATGDS F I AG 
QEKIDVIKGHFPQVLAVEMEGAAIAQAAQATGKPFWVRAM^ 



SeqID 335 
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MEKRLS LGALVLASTVLIjAACGNVGGGAS stgtki gkdikvgynwelsgnvs s ygnsmkngadlavke inaaggvgg 

KKLKVL SQDNKS ENAE AATVATNLVTKGANVI IGPATSGAAAS STPKVNAAAVPMIAPAATQDNIiVYGSDGKTLNQY 

FPRATFVDNYQGKLLS QYATDNLKAKKVVLFYDNS SDYSKGVAKS FKE S YSGKIVDSMTFSAGDTDFQAS LTKLKGK 

EYDAIVMPGYYTETGLIWQARDLGISKPVIiGPDGFDSPKFVQSA^^ 

YGEEPSMFSALSYDAWMAAKSAKGAKTSIDLK^ 

SAK 

SeqID 33 6 

MENHNS IKQT YGLMTT I AMI VG WI GS G I YFKVDD I LKFTGGD VFLGMVIL VLG S F S I VFGS L S I S ELAI RT S E S GG 
IFSYYEKYVSPAIiAATLGLFASFLYLPTLTAIVSW^ 

SLTTFVKMI PLVLIALIGAFWSDKAPQLPQHLTAIQPSISTVGWSWVSGLVPLYFAYDGWTI FVS IAPEVKNPKKNIjPLi 
AFVI GPALI LLS YIjAFF YGLTQXLGAS F IMTTGNDAINYAANI I FGP S VGRLLS F I VILS VLGVANGLLLGTMRIjPQ 
AFAERGWIKSERMANINLKYQMSLPASLTVTAVAI FWLFVHFMVTKFNLLPGSDI SEI AVVFITOTSLI ILYVXjVXiSIi 
YLKKDIKNKFTGLVSPIIAILGGLILFIGSLLSNFFTVLIFQCFCLLFCLICHYIYQKl^NPKTHE 

SeqID 33 7 

LTEFITODQHSNHDQKSFKEQILAELEEANRL^^ 

TKQRLEEQEKARIEAKLIiAEAAREEERRQAEQAIjASQEEQVINQGM^ 
DVATAVPNQETEEIFLVRATDIPTEGEITVKLGETSELEPV^ 

AKRIARIL I S 1 1 1 LVLLLTAFVGYRFVDSAI KPVDSNSNKFVQVEI PI GSGNKLI GQI LEKAGVIKSAT VFNY YSKF 

KJTCSNFQSGYYNLKKSMTLDQIAAELEK^ 

KLVQDETFIKKIWAKYPNLLGSL^ 

SVISTOVLTLSSLVEKEGSTDEDRRK^ 

TGIiMPGPVDS PS I SAI EATI KPAS TDYL YFVAD VKTGNVYYAKD FETHKANVEKYINS Q IN 
SeqID 33 8 

MSRMIPGRIRNQGIELYEQGLVSLISQEGNLLKA^ 

EGKAILSKVQVQQESQQETKKKTSFGSVFLDSLIINEDDTIKYQLSAQGEQNPYANDIVmTLKIRRIiPDDRSWI^ 

IKAFLNTVRKEAYYQIGKQYFETLSLIQFDETSQELIEFLWRLIPSHSSKIDLEFILPNQGRHIiSLTRGFFEEGVTL 

MKTALENFSFESDFHQFNHLYFKELEGEDH^ 

TAIRSLPIEGDLAKHIHFDLDDQDKLAAHLL^ 

VH3SIRQELEQLTFASHFKHEEKIFKLLERYGFAPHFSTSHPAYSAQELYDFYTYMLPQFKKMGTVSLSAKLESYRLIE 
RPQIDIELAKGSLLDISFDFSDLLENDVDQAL^ 

DNI AAFQLS ELFA2STQDNVS FS QHFYQL IEDLRHPEKFKI PGhSVSAS LRDYQIiTGVRWL SMLDHYGFAGILADDMGL 

GKTLQTISFLSTKLTRDSRVLILSPSSLIYNWQDEFHKFAPDVDVAVAYGSKIRRDEIIAERHQVIITSYSSFRQDF 

ETYSEGISTYDYLILDEAQVMKNAQTKIAHSLRSFEV^ 

QVARYIKPFVMRRRKEEVLPELPDIilEMNYP 

ICDTPRLFMDYDGESGKLESLHQLLTQIKENGHRALIF 

FHAGS KDAFLI SLKAGGVGLNLTGADTVVIIjIDLWWNPAVEMQAI S RAHRLGQKENVEVYRLITRGTI EEKILEMQET 
KKHLVTTVLDGNETHASMSVDDIREILGVSK 

SeqID 33 9 

VLLTEIKKSPEGLYFDKKIDIKESLMERHSEIMD^^ 

QTINEVFIEAENVSTKKELVDQDLVLILEEDDI^ 

EKQAKEKEKSNPFAALEGMFDSD 

SeqID 34 0 

MEVKTAIEWMHTFJtfQKIQSNKDYL^ 

S GPLYGSAFMGITKAEQSESTI SEALGAGIiEMI QKRGKAELNEKTMVDVWHGVIEAIEKKEIiTEDRIDS LVDATKGM 
KATKGRASYVGERSLGHIDPGSFSSGLLFKAIiLEVGGV 

SeqID 341 

MQDKLMIRGARAHNLKNISVDIPRDKLW^ 

DGLSPAISIDQKTTSIOSrPRSTVGTVTEi:i^^ 

IIRRKKGQHKSTFEKIQKDGYWWIDGDIHDVTEVPEL^ 

YVVIDTI^GNELLFSEHYSCPECGFTVPELEPRLFSFNAPFGSCT 

NPISSISTCYPTMLEQAMTQFDVD^TPFEK^ 

ETNSDYTRlSrraREYl^ELKCOTCHGYRLWDQALCVRVGGE 

KE I HDRLS FLNNVGLNYLNLSRSAGTLS GGE S QRIRLATQI GSNLSGVIiYVIiDEPS I GLHQRDJSTDRLIDSIiKKMRDLt 
GNTLI VVEHDEDTMMAADWLIDVGPGAGAFGGE IVAS GT PKQVAKNTKS I TGQYLSGKKVT PVP S ERRVGNGRFLE I 
KGAAElSnsiLQNLDVKFPLGKFIAVTGVSG 

GRTPRSNPATYTGVFDDIRDLFAQTNEAKIRGYKKGRFSFJWKGGRGESCSGDGIIKIEI^FLPDVYVPCEVCHGTR 
YNS ETLEVHYKEKI^IAQILDMTVNDAVTFFAAI PKIARKLQTIKDVGLGYVTLGQPATTLS GGEAQRMKLAS ELHKR 
STGKSLYILDEPTTGLHADDIARLLItt7I^ 
AENPKSYTGYYLKEKLAR 



WO 2004/099242 PCT/EP2004/004856 

87/109 



SeqID 3 42 

MWPEDRIASFRRTLLGWYDQEKRDLPWRRTTNPYYIWVSEIML 

KAWEGLGYYSRVRJSTC^QKAAQQVMVDFGGI FPHTYDDIASLKGIGPYTAGAIAS IS FNLPEPAVBGNVMRVMARIjFEV 
NYDIGDPKNRKIFQAIMEILIDPDRPGDFNQAIiMDiLGTD 

KPMRIQAFVIRNQNGQYLLEKKTKGRLLGGFWSFPIIETSPLSQQLDLFDDNQSNPIIWQTQNETFQREYQLKPQWT 
DNHFPNIKHTFSHQKWTIELIEG\^VKATDLPNAPHLKWAAIEDFSLYPFATPQKKMLETYLKQKNA 

SeqID 343 

LIEITWTVKYITEFIATAFLI ILGNGAVAKTVDIjKGTKGNHSGWI I IAIGYGLGVMMPAIiMFGNVSGl^HINPAFTIiGIi 
AF S GL F PWAHVGQ Y I LAQ ILGAMFGQIiVVVMVYQP Y FVKTENPjBTHVLGS F S T I SALDD GQKS S RKAAY IM3FLNE FV 
GS FVLFFGALALTKNYFGVELVGKLVQAGYDQTTAATRI S PYVTGSLAVAHLGIGFLVMTLVAS LGGPTGPALNPAR 
DLGPRIVHRLLPKQILGQAKEDSKWWYAWVPVIjAPIVASIIiAVALFKIjIiYL 

SeqID 344 

MSSYWISrfcTYPELjKKNIDETNQL^ 

EILHVATLIHDDVIDDSPLRRGISnyiTIQSK^ 

MHLRYNQQQGIHHYLRAISGKTAELFK^ 

VLEDLTQGVYS LPIiLLAIEENPD I FKP I LDKKTDMATEDMEKIAYLVVSHRG VDKARHLARKFTEKAI SD INKL PQN 
SAKKQIiLQLTNYLLKRiCI 

SeqID 345 

MFKIPLFKELKTDQWVKPFFKQYKVSLVIALFL 
GRPVFRYIERLTSHNWVLRMTSQLRLKLYHSLESN^ 

S FI 1 I ATGFFSLWFALMMLLYLAIMIFLFPLWS IIiANGARQTREKELKNHLYTDLTDNVLGI SDWI FSQRGQEYVAL 
HERSESELMAIQKKIRSFJSnSTRRAIiIVELVFGFI^ 

QETNKYSDSIHRLNELSETYFETTQNQLPNKPYDFSVKNLSFQYKPQEKWVLHHLDLDIKEGEKIAILGRSGSGKST 
LAS LIjRGDLKASQGKI TLGGADVS I VGD C I SNY IGVI QQAPYLFNTTLLNNI RIGNQDAS EEDVWKVLERVGLKEMV 
TDLSDGkYTMVDEAGLRFSGGERHRIAI^ILLKD^ 
GIEHADRILFIENGQLELEGSPQELSQSSQRYRQLKAADDGDL 

SeqID 3 46 

MKKLITEKKVNNVSTVNYLKLGLVSAMFAGGAFVALGSTQGV 

YI QGSIiS GI LNQHS TL YKQQNAAVTPQVS S PKAERQLTDS EI YERAQKQVLPKYIQGSL S GILNQHS TLNA 
SeqID 3 47 

MBENMNI KQLKSKTMLGTVALVSAFS FAS TNADANTYJSfYAVDVDYLASAEE I AQAHPASNT FPLGQCTWGVKEMATW 
AGNWWGNGGDWAAS AAS AG YTVGTQPRVGS I VCWTDGS YGHVAYVTAVDPVTNKIQVLE SNYAGHQWIDNYRGWFDP 
QNTATPGWSYIYPN 

SeqID 3 48 

MKINQMKKDEL FEGF YLI KKAEVRKTRAGKDF I AFTFRDDTGE I SGIMWDAQT YNVT2EFVAGKI VHMKGRREVYNGT 

PQWQITIjRHIKDGEPNDPRDFKEKPPIISrVDNVREYMEQMIiFKIENATW 

ESGLAYHTATiyTTOIxADSIGDIYPEI/NK^ 

NIDDTKEEVTVLRHVILSHHGQLEYGSPWPRIMEAEIIH^ 
KPNIK 

SeqID 349 

MLVDKKWRFEDSASYFACPKCQNPLIKESNSLK 

NHILEAISK^LENNSQFHSVLDIGCGEGFYSRQI^^ 

DSSIDIILDIFSPANYKEFRRVLSDDGILVKVVPV^^ 

QSYNCSQQERQAFIDMTPLLFSVDKTTIDWASISEITVGALIVIGKKRSVSK 
SeqID 3 50 

MCLICQRIEMIERJSTENPYFVKEYETGYLVLGDHQYFQGYCLFL^ 
RADKmiESLGNGDAHLHFHLFPRKTGDLRlSr^GHNGKGPWWYPFE 

SeqID 3 51 

MFSDLRKKFVFLTMS ILI VWLFLFAVSIffiY^ 
NTDLTNDVIEKSSLKLLEQGKKSRK^^ 

YLSRFIVGPVETEMTREKQFVSDASHELKTPIAAIRAWQVLEQQIPGNRYLDHWSETKRMEFLIE 

KRSKVNFKKLNIiSVLCQEV^ 

QARRKAILTISNPSAIYSKEVMDNLFERFYQ 

DGFWTNTMIN 
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SeqID 3 52 

LIILDKKSYDLLFYLLKLEEPETVMAIANA^ 
LDEVTDYSYVMKSSERLQLSLVSIVVAKDR 

DGHPLSXIQYLYKLLDDIYHNGSSSFIDLFNHKLSQAPGASTYFSKEVI^yFHHYLPISQRSLGKKXNSQDGQFMIQ 
I LPFILMAYRKMRLS PEVQTSLNSDFSLVWQRKE YE IAKELADELEENFQLSIiDE I EVGLVAMLMLS FRKDRDNHLE 
S QD YDDMRATLTS FLKELEERYHLHFVHKKDLLRQLLTHCKALLYRKRYGI FS WPLTEHIKDKYEELFAITS S SVK 
LLEKAWQIKLTDDDVAYLTIHLGGELRHSQQSPNKIaKLVIVSDEGIAIQKIiLLKQCQRYLTNSDIEAVFTTEQYQSV 
SDLMHVDMVVST SDALE SRFPMLWHPVLTDDD I IRIiIRFS KKGNCANSNQFTNELEKT I AQYVKED SERYVLKS KI 
EKLIHQELLQDVLPLQSTVC 

SeqID 353 

MKYFLDTADVSEIRRLNRLGIVDGVTTNPTI I SREGRDFKEVINEI CQIVDGPVSAEVTGLTCDEMVTEAREI AKWS 
PKfWVKI PMTEEGLAAVSQLSKEGIKT3STVTL I FTVAQGL S AMKAGATF I S PFVGRLEDI GTDAYALIRDLRHI IDF Y 
GFQS EI IAAS IRGIiAIH/EGVAKCGAHI ATI PDKTFAS IiFTHPLTDKGI ETFLKDWDS FKKK 

SeqID 3 54 

MSHYSIKLQEVFRLAQFQAARYESHYLESWHLLIAMV^^ 
NYQFLEQSPALKKILKDAENISIVVGAEDVGTEHVLIiA^^ 
AGFTKDDIKAIYEIjRNPKKAKSGASFSD^KPPSTAG 

lSTNPVLVGDAGVGKTALAYGIiAQRIAHGNI PYELRDMRVLELDMMS WAGTRFRGDFEERMNQI I ADIEEDGHI ILFI 
DELHTIMGSGSGIDSTLDAANILKPALARGTLRTVG 

KPAYEAFHNVT I SDE AVMTAVKVAHRYLT SKNLPD SAIDLLDEASATVQMMI KKNAPS LLTEVDQAI LDDDMKS ASK 
ALKASYKGKKRKPIAVTEDHIMATLSRLSGIPVEKLTQADSKKYLNLEKELHKRVIGQDDAVTA 

TGKRP I GS FMFLGPTGVGKTELAKAliAEVLFDDE S AIiIRFDMS E YMEKFAAS HLNGAPPG YVGYDEGGELTEKVRNK 

PYS VLLFDEVEKAHPD I F1WLLQVLDDGVLTD SRGRKVDF SNTI IIMT SNLGATALRDDKTVGFGAKDI S HD YTAMQ 

KRIMEELKKAYRPEFINRIDEKWFHSLSQDNMREW^ 

P LRRT I QTQVEDHLS ELLLANQVKEGQVI KI GVSRGKLKFDI AKS 

SeqID 355 

MVLDKEIKAQLAQYLDIjLESDIVLQADL 

sglpmgheftsfilallqvsgrapkotediikrikgiektinletw 

ggmyqdevkskgimsvptwkdqeeftsgratieqlleqldgpldaeafadkgwdvlvigggpagnsaaiyaarkg 

lktgijaaetfggqviewgienmigtlytegpkli^^ 

tai^lgakwrniwpgeeefrnkgotycphcdgplf^^ 

qvlqeraaktdnltilk^watkdivgedhot 

wdkfgstni pgi faagdctdaaykqi 1 1 smgsgataai gafd ylirq 

SeqID 356 

MIKYQDDFYQAWGEWAKTAVIPDDKPRTGGFSDLADDIEALMLS 

REEVGVSPVLPLiIEEYKGLQSFSEFASK^AEYELEGKPNEFPFGVAPDF^AQLNVLWAEAPGIILPDTTYYSEDNE 

KGKELLAFWRKSQEDLLPLFGLSEQEIKDILDKVL^^ 

IFQKILGQKPDI<VIVPEERFWTEFASDYYSESX^/tfELLK^ 

KKAAYYLAS GPYNQALGLWYAGEKFS PEAKADVEHKIATMIDVYKSRLEKADWLAQS TREKAIMKLNVITPHI GYPE 

KLPETYTKKIIDPKLSLVENATNLDKISIAYGWSK™^ 

EQSSSANYGGIGAVIAHEISHAFDTNGASFDEHGSLN^ 

NVADLGGVACALEAAQRE SD F SARD FF INF ATI WRMKARDEYMQMLASVDVHAPAQWRTNI TVTNFEE FHKE FDVKD 
GDNMWRP VEKRVI IW 

SeqID 3 57 

MSKILFFGEPLIRITPKE1STOYFADSISTKLFYGGSEVN 

SIQWVGERVGLYFLEDSFACRKGEVVYDRDHSSLHDFRINQIDFDQLFEGVSLFHFSGITLSLDESIQEITIiLLLKE 
AKKKEITISLDLNFRSKLISPK^AKILFSQFATFADICFGIEPLMVDSQDTTFFNRDEATIEDVKERMISLINH 
QVI FHTKRIiQDE WGRNHYQAYIANRKQEFVTS KEI TTAVNQRIGS GDAFVAGALYQLLQHSDS KTVID FAVASAS LK 
CALEGD1SIMFETVTAVNKVLNQSKDI IR 

SeqID 3 58 

MKSRKKDKLVIiRLTTTLLVFGLGGWFYNY 

ES S S GQSDLSKAPNYI^FGIKGE YKGKS VQMPTLEDDGKGNT^QIQAPFRAYPNYS AS L YDYAELVS S QK YAS VWKS 
NTS S YKDATAALTGLYATDTAYASKLNQI IETYSLDAYDK 

SeqID 3 59 

MADVKVVNNEDSRGQKQDLKAKLFHIKIGSVPLPVYVCLALLILLAGF 
PGFKNFGGPAILSLLVPSILVFFNLINKNVLEST^ 
GMVCAMMVGTFVGVILGLEWRHTLFYIVTP 
ALLHRLGEKKPHLSGQGQLWLNKGEDMSDIIAD 
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AI LKYINVI PRETQNGAKQLYKF I SGNFTFPLMAGLGLLYIPLKDWATLS I QYFIWI SWFTVI S VGFFVSRFLN 
MNPVEAGIISACQSGMGGTGDVAILSTADRMNIjMPFAQVATRIaGGAITVITMTAILRMLF 

SeqID 3 60 

RGKKAGKYTTSDGYI FDAKDIKKDTGTGYVI PHMTHEHWVPKKDLSESELKAAQEFLSGKSEANQDKPKTGKTAQE I 
YEAIEPKAXVRPEDLLFGIAQATDYKNGTFVIPHK^ 

KRPKVEGWGKDAE I YKEKD SNKADKP S PAPTDNKSTSNS SDKNLSAAEVFKQAKPEKI VPLDKI AAHMAYAVGFEDD 
QLIVPHHDHYHETVPMAWFDKGGLWKAPEGY 

EEPEDSGKVTHirYGFYDVNKGSDEEEPEKQEDESELDEYELGMAQ^ 
IHGSQVQVTKKDGSKVLVDIKTLTEVK 

SeqID 361 

MNRKKTVI I SALSVALFGTGVGAYQLGS YNAQKSDNS VS YVKTDKSD SKAQATAVNKTPDQI S KEEGIS AEQI VVKI 
TDDGYVTSHGDHYHYYNGKVPYDAII S EELIMKDPS YVFHKADVINEVKDGYI I KVNGKYYLYLKEGSKRTNVRTKE 
QI QKQREEWSKGGSKGESGKHS S AKTQALSASVREAKASGRYTTDDGYVFS PTDVIDDMGDAFLVPHGDHFHYI PKA 
DIiS P S EL S AAQAYWNRKTGRSGNS SKPSNS S S YI HASAPSGNVSTGRHANAPI S I PRVTHANHWS KPAGNHAT APKH 
HAPTTKPINKD S ALDKMLKRLYAQPLYARHVE SDGLVYDPAQVMAFTAI GVS I PHGNHFHF IH YKDMS PLELE ATRM 
VAEHRGHHIDALGKKDSTEKPKHISHEPNKEPOT 
FLMPKILKKIQVQVMS FHI 

SeqID 362 

MAKFGFLSVLEEELDKHLQYDFAMDWDKKISffi 

DAEDYLVTIPYEPKKGLSREFLAYFAETmEVATEGLSDLMDFLTDDSIEEFGLSWDTDAFENGRAELKETEFYPYP 
RY 

SeqID 3 63 

MNTKQRFSIRKYKLGAVSVLLGTLFFLGGITl^^ 

NLVTKANNS PTETQPVAESHSQATETFS PVANQPVESTQEVSKTPLTKQNLAVKSTPAI SKETPQNIDSNKIITVPK 
VT^TGYKGEGTWAIIDSGLDIiraDALQL^ 

VKS T S HGMHVT S I ATANP SKKDTNEL I YGVAPEAQVMFMRVF SDEKRGTGPALYVKAI EDAVKLGAD S INLSLGGAN 
GS LVNADDRLI KALEMARLAGVS VVI AAGNDGTFGS GAS KPS ALYPDYGLVGS PSTAREAI S VAS YlvTNTTLVNKVFN 
1 1 GLENNRNLEntf GLAAYADPKVSDKT FEV FTKKVVHAINHGA 
VGAIIFNNKAGEAOTaTMSLDPEASAIPAIFTQKEFGDVLAKM 

KPDLSAPGGS I YAAIWDNE YDMMSGTSMAS PHVAGATALVKQYLLKEHPELKKGDIERTVKYLLMS TAKAHLNKDTG 
AYT S PRQQGAGI IDVAAAVQTGLYLTGGEJSTNYGS VTLGNI KDKI SFDVTVHNINKVAKDLHYTT YLNTDQVKDGFVT 
LAPQQLGTFTGKTIRIEPGQTQTITIDIDVSKYHDMLKKVMPNGYFLEGYVRFTDPVDGGEVLSIPYVGFKGEFQNL 
EVLEKSIYKLVANKEKGFYFQPKQTNEVPGSEDYTALMTTSSEPIYSTDGTSPIQLKALGSYKSIDGKWILQLDQKG 
QPHLAISPJSTDDQNQDAVAVKGVFLRNFWJSnLRAKOT 

GTTTDGIPLEDGKYKYVLTYYSDVPGSKPQQMVFDITLDRQAPTLTTATYDKDRRIFKARPAVEHGESGIFREQVCY 
LKKDKDGHYNS VLRQQGEDGI LVEDNKVFI KQEKNGSF I LPKEV1STDF SHVYYTVF1D YAGNLVS AKLEDL INIGNKNG 
LVITVKVFSPELNSNVDIDFSYSVKDDKGNIIKKQHHGKDLNLLKLPFGT^ 

D S LKI)VLFKVNLLKKAALLVE FDKLLPKGATVQLVTKTNT WDLPKATYS PTDYGKNI PVGDYRLNVTL P SGYS TLE 
NLDDLLVS YKEDQVNLTKLTLINKAPL INALAEQTD I ITQPVF YNAGTHLKKTNYLANLEKAQTL IKNRVEQTS IDNA 
IAALRESRQALNGKETDTSLLAKAILAETEIKG]^ 

I AKNQLNGHETDYSGLHHMI I KANVLKQTS S KYQNAS QFAKENYHNL IKKAELLL SNRQATQAQVEELLNQ I KATEQ 
ELDGRDRVS SAENYS QS LNDNDSLNTTP INPPNQPQAL I FKKGMTKE S EVAQKRVLGVT S QTDNQKVKTNKLPKTGE 
STPKITYTILLFSLSMLGLATIKLKSIKRE 

SeqID 3 64 

MISIKNEKKVKYFLRKTAYGIjASMSAAFAVCSGIVHADTSSGI SAS IPHKKQVNLGAVTLKNLI SKYRGNDKAIAILLS 

RVITOFNRASQDTLPQLINSTEAEIRNILYQGQ 

VKSISSSKDSQLILKFVTQATQLMSTAESTKAKQMAQNDV^ 

NSXraETLVNQANGKKQEIAKLENLSI^^ 

KKIERI SSKGLALSKKAKEIYEKHKSILPTPGYYADSVGT 

KELQDLTRGTKEDKKPDVKPEAKPEAKPNIQVPKQAPTEA^ 

AVQKAVDQAYDHVEEGKFITTDQANQLANKL 

AKPDVKPEAKPD VKPEAKPEAKPEAKSEAKPEAKLEAKPEAKPATKKS VNT S GNLAAKKAI ENKKYS KKLPSTGEAA 
SPLLAIVSLIVMLSAGLITIVLKHKKN 

SeqID 3 65 

MAKDIKFSADARSAMVRGVDILADTVKVT^ 

SKTND I AGDGTTTATVLTQAI VREGLK3STVTAGA3^I GIRRGIETAVS AAVEELKE I AQPVS GKEAI AQVAAVS S RS E 

KVGEYISEAMGRVGJSTDGVITIEESRGMETELEVVEGMQFDRGYLSQYM^ 

LPLLEEVLKTEHRPLLI IADDVDGEALPTLVLNKIR 

DATMQVLGQS AKVTVDKD STVT VEGAGD S S AI ANRVAI IKS QMEATT SDFDREKLQERLAKLAGGVAVI KVGAATET 
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ELKEMKLRIEDALNATRAAVEEGIVSGGGT^ 

IERLKQSEIGTGFNAANGEWVDMVT 

F 

SeqID 3 66 

WENLEKPIGVSYKNSPSMSKRTAIRMKKSSRFSILL^ 

SDVFATGGFLYYVTIALSYLLGSSIWLLIVQFIAYYVSGIYFYKLVYYVAQSEIVSIGMTLIFYIKN 

IQWAIiFFMLISLWFLIKFCVDNIVDEAFIFYGII^ 

FGMILIAYTVGYFMFNLQIISSYIDKAIFYPFTYFART^ 

LNFIACIFIIYAIMAIFSRDFNLYHFLPALPFGLLLTSNKIT^ 

LLLVSLSIGLLVYOTYQl^ 

I LKDELL QHGAKYF I LNKNE KL PNELKSD I KKH YQE VPL SNI THFVL YRFK 
SeqID 3 67 

MAKPTT S PGMQQYLDIKEISTY'PDAFLLFRMGDFYELFYDDAVKAAQILEISLTSRNKNAEKPI PMAGVPYHSAQQYID 
VLVEBGYKVAIAEQMEDPKKAVGVV^ 

LLTDFESWSEILNLKAREIWGYQLTDEKNHLLTKQMNLLLSYEDERLTOIHLID 

QKRELSHLQK^M^YEIKDYLQMSYATKITSLDL^ 

RQDIIQVFiQDYFFERiroLTESLKGVYDIERIiASRVSFGK^ 

DTMPELESLINTAIAPEAQATITEGNI IKSGFDKQLDNYRTVMREGTGWIADIEAKERAASGIGTIjKIDYNKKDGYY 
FHVTNSNL SLVPEHFFRKATLKNS ERYGTAEIiAKIEGEMLEAREQS SNLE YDI FMRVRAQVE S Y I KRLQELAKT I AT 
VDVLQSLAVVAE!NYHYVRPKFlSrDQHQ I KI KNGRHATVEKVMGVQEYX PNS I YFDS QTD I QL ITGPNMSGKSTYMRQL 
ALTVIMAQMGGFVS ADEVDLPVFDAI FTRIGAADDLI S GQSTFMVEMMEANQAVKRASDKS L I LFDELGRGTAT YDG 
MAIAQSIIEYIHDRVRAKTMFATHYHEM^ 

PIDLLDRATDILSQLEADAVQLIVSPSQEAVTADLNEELDSEKQQGQLSLFEEPSNAGRVIEELEAIDIMISrLTPMQA 
MNAIFDLKKLL 

SeqID 3 68 

MLKLDLKTKIKEAIIilAFGVALYT^ 

KSLLLTIYGTVLMSFFMWFWQQIPVTVPLKNDMML^ 

QTLLFIDAIVLTSSLVYINLQQMLYTLVASFWSQV^ 

QGAYSGREHDVLYVALNPSETODVKEIMADLDPDAFISVIlSrroEVISSDFKIRRRNYDK 
SeqID 3 69 

MKRSMYAGRVRSEHIGTSITLKGWVGRRRDLGGLIFIDLI 
VTAREQAKTONLPTGEVELKVQELSVLOT 

LDNLEFIDVETPMLTKSTPEGARDYLVPSRVNQGHFYALPQSPQITKQLLMNAGFDRYYQIVKCFRDEDLRGDRQPE 
FTQVDLETSFLSDQEIQDIVEGMIAKVMKDTKGLEVSLPFPRMAYDDAMNOT 

KVF S EAS VVKAI VVKDKADKYSRKNIDKLTEIAKQYGAKGLAWLKYVDNTI S GPVAKFLTAI EGRLTEALQLENNDIi 
ILFVADSLEVANETLGALRTRIAKELELIDYSKFNF 

KOT^VAYDIVLNGYELGGGSLRINQKDTQERMFKALGFSAESAQEQFGFLLEAMDYGFPPH 
KIJNIREVIAFPKMNKASDPMTQAPSLVSEQQLEEIiSIjTVESYEN 

SeqID 37 0 

KEKGKLIKKKILENNTDIIPKIIEK^ 
ELFDKARELGYFVPALYNSYAMAFRKIKOT^^ 

SeqID 3 71 

LNVKKHHLAYGAI TLVALF S C ILAVMVI FKS SQVTTE SLSKADKVRVAKKSK1OTKATSKSK?TODVKQAPKPS QASNE 
APKS S S QS TEANS QQQVTAS EEAAVEQAWTENTPAT SQAQQAYAVTETT YRPAQHQT SGQVLSNGNTAGAI GS AAA 
AQMAAATGVPQSTWEHI I ARE SNGNPWVANASGAS GLFQTMPGWGSTATVQDQVNS AI KAYRAQGL S AWGY 

SeqID 3 72 

MSNWDTKPLKKGFTFDDVLLI PAESH^ 

MS IVDQAEEVRKVKRSENGVI IDPFFLTPDNTVSEAEELMQNYRISGVPIVETLENRKLVGIITNRDMRFI SDYKQL 
I S EHMT S QNLVTAP IGTDLETAERI LHEHRI EKLPLVDDEGRLS GLITIKD I EKVI EFPKAAKDE FGRLLVAGAVGV 
TSDTFERAEALFEAGADAIVIDTAHGHSAGVLRKIAEIRAHFPN^^ 

I C T TR WAGVGVP Q I TAI YDAAAVARE YGKT 1 1 ADGGI KYS GD I VKAIiAAGGNAVMLG S MFAGTDEAPGETE X FQGR 
KFKTYRGMGS I AAMKKGS SDRYFQGS VNEANKLVPEGIEGRVAYKGSVADI VFQMLGGIRS GMGYVGAANIKELHDN 
AQFVEMS GAGLKE S HPHDVQITNEAPNYS VH 

SeqID 3 73 

MTINHYKLRI PYYTI SFLLPFI IIVCILFTKNI YWGS PTTIIiASDGFHQYVIFNQALRNILHGSNSLFYTFTSGLGD 

NFYALSSYYLGSFLSPIVYFFNLKDIMPDAIYLLTI^ 

IEINMWLDVFILIPLWLGVDQ 
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FTFLSLLAGMTS S IMI LPTY FDLTTHGEKLTKVSKMFTENS WYMDLFiUCNMIGAYDTTKFGSIPMIYVGIaLPLLIiSL 
LYFTIKETV"PRRTRIjAYGFLIIFVI 

GVNLILVSGFIITFLFKKHYHYLNLELLLLTLTFLSAYIILTISFVSKQIPKLVFYPFLIGFVVLEMTLNTFYQIiNS 

LNDEWIFPSRQGYAKYI^SISKLWKTERNN^ 

SDGTNLNLRYQMrTLIADSLFGVKY^ 

QTTLLNQLSGLK^TYFEHLIPNSVSGQTTLNK^^ 

S VNNGPFVPNTLDNAYS FFNI GS FAENSRIKVKFQFPHNDQVS FPI PHFYGLKLEAYQKAMTVINKR.KVKVRTDHNK 
VI ANYTS PNRS SL FFTI PYDRGWKAYQNNKE IKI FKAQKGFMKINI PKGKGKVTLI FI PYGFKFGVGLS ITGI VLFT 
VYYFKFGKNKIG 

SeqID 3 74 
WFMKHSVGKRHNLVT 

SeqID 3 75 

MKDHLRCFLHYRSKNVGKNGTRLKGGKPND 
SeqID 3 76 

DKQLFQCVQGFQIRNLNGKKHGIKLIFIKNVKH 
SeqID 3 77 

NEEAIDGVSKLGKKITILGNNSYLSS 
SeqID 3 78 

IKQWLIiVMKKMTVLCIiKLLEILLLCKTAILN 
SeqID 3 79 

RRI SNNLGVLKEATVIHAV 
SeqID 3 80 

ELCLKLSNVKPIYMPWHFEQISIiKKQVSRVYHALFE 
SeqID 3 81 

RS CFSRC YPRFNRFRYHFLKPPWTRTNYCKRY 
SeqID 3 82 

RWFGFRYDDGRKFS FTNLL 
SeqID 3 83 

PLKSLSMTSRMKLLRLSQRKAM 
SeqID 3 84 

RHLRKRQKQNHLMNLCFLCAVKKLKKDNLI 
SeqID 3 85 

KLFIQPIQENMKQPIEIISNYYAKLGLMKI 
SeqID 3 86 

YQTYSKKARKFGSVWLKPK 
SeqID 387 

TLLFQILLLLSHLKKlSriFLEWNMHATYN 
SeqID 388 

KKFYEIILATKYHTSVFKWSLKLKNKL^ 
SeqID 3 89 

LKQMFRRVLHLLFQWLIATOSLMKMVLKSSWDGWCSLSLLMRIIQ 
PLILEKYLKQITALWAAEYLKVILFLKKPRDSF 

SeqID 39 0 
GRKINFSRPSRFE 



SeqID 3 91 

RVKYKKWEPLSISRSCYEKSSSTSSRI 
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SeqID 3 92 

TWSRCCIPRCYSWCKIRRSNQCFGRWTSSFG 
SeqID 3 93 

KENF VQRLRS LLKAI S KNI 1 I YSLVS LPLARHL 
SeqID 3 94 

SIINIKLEIYDI CPKTI SNRESIHGYRKLLPRLLFGDYLSKTLFQGFTSNHPNCRPIRL 

SeqID 3 95 
LSFRRSRRIGSSHSQI 

SeqID 3 96 

FI FKDRFKMRHFTD 

SeqID 3 97 

GLICACKAFISAILANLFCSSVIPIASCPENICAKPRTRFSSTSD 
SeqID 3 98 

S S S LDTH YKHREL FLVADS LGS LWL ILLVAYFV 
SeqID 3 99 

HLVPTQYHKQNLPLEYKRLNALVHK 
SeqID 4 00 

AHLHEGYHKYPLKRRNLYEFLRKL 
SeqID 4 01 

PLKELREH Y I I PRKI RLQNFPHR 
SeqID 4 02 

KYTIAGSRVTQRRPSFSVTKV 
SeqID 4 03 

AAFSSPAAKNRAFITTRSS 
SeqID 4 04 

SMVKQISFSDHIiISRFEGLCLRRFSVRFRIiRRSLSSVSNSTNNTSWL 
SeqID 405 

YHVLSRYHLNPLHVXiQALYAHQEYVIVLIIHQRELFYRGQRGQ 
SeqID 4 06 

AQPSSTRISPKIDRFTPK 
SeqID 4 07 

WNRTIWYFLNNLQLVTNI KLPTRKI FQS QNKFL S F IKN 

SeqID 4 08 
VQEGHKI FPLQR 

SeqID 4 09 

QQSNNTANYSRPYKPWHKKKNGLSCRCFS 
SeqID 410 

FFDDTKVNIHWLEMFAVCMCNIMWQCTNSSSL^^ 
CSKGLV 

SeqID 411 

VYSIjRI ATLELERNLIDKRRGKPTDSRS IEFLEVSLAILSCKKTKIiI PDIMLEKTFPLVS SEAKSEDLKFKS AI SLNL 
ENCS 



SeqID 412 

GHSNGIKEGNGGIFPVFCIAITATRFRFTINGFCNRINPCLCHFIVTLLSCCNRKS 
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SeqID 413 

RIEKAIVPKPAI SVLMIISRMVXjGTPEKEPVICSANGIISXVIATPADXEIQMSKADIIKAKPVEKLFCENSLTELT 

KAAPGTITISAPNIIMLRCSPNPEKYNKLAL^^ 

KEKLPAPQK 

SeqID 414 

P YSVT YAAPVI CNGADIiARRTAYAMS S S VTKP SDVEELVT IGDRTKRFSRLNPLGS INGRNKLPWTLFI S AHRPLI S 
HRLSLKSIKLCN 

SeqID 415 

ANHNDKNNTISVNKFSAK 
SeqID 416 

SFCVFCCHSKKGSYPHPKQSSWSSRFNSRSHTNDITCPYRCSKSSCKGSIT 
SeqID 417 

TLGI S FKIRGCHFSAKPFLGGKS PDFPLPG 
SeqID 418 

FNISNSSLGFGTSAKSASTSSANSSLIISCDKATHSLQMKTPIiPAIIFFTSSWLLPQNEQ 
SeqID 419 

GVYKKQSLSLIFHQIHLIVYHFDFLHDRFVPYYRE 
SeqID 42 0 

IFLRSVCGNSRQALGLTSYILHKLSSPKN^^ 
QIjAHAVQSIS 

SeqID 421 

SRISTITIPIRISPTFLYREDFSSGLKGSFGLRLSSVKYFFPSKVYWRRPKNIPIAASEKPRWKFTSCPRYPTTMGE 
I KAPRLI PKTKI LKPASRRES F S P YKLPT I SDT FGFNS PVPRI IRAI EMNRAV 

SeqID 422 

SFPRSNNLCAFKDKFKCRTSSRLDCL 
SeqID 423 

IHSFHRRKLNHHGKLNLYHLVLEKQPGLiSSLLIMLYQSPDL 
SeqID 424 

LSFSIVSHLLTTIIIPLPASCAWPAIRLSYSRIPSSALITIRTTSERSMARIERITEYFSVFSYTLPDLRIPAVSII 
VYSCPWLSTKLVSIASRVVPATGLAITRSSPRMALIKLDFPTFGRPIKLKRIISGFSFSSSTGRFSTIASNISPVPI 
P 

SeqID 425 

CPSSDIAGINFTKNSGKIDKQTVT 
SeqID 42 6 

PFLRNHFHNNETSTKFFSDSTKSRIGNTCHRC 
SeqID 427 

ASTGSAANPQTTPGKLLQASIAPIPVPQPVITKSAAPLLSKIAVSKPFCT 
SeqID 42 8 

AMPKRCFIPPEYDEIFFLPTSPKPTSSSNLGISCLAFFLSTPFKVAI 
SeqID 429 

LLLILKNPDTTHNTTIKRQPNKPPKVLRKRTEKKSIKLMKRIPINHVAP 
SeqID 43 0 

NFKARPPGWSGLPNITPTFSLEJWLIKIAIVFVLLMDADNLRIA 
SeqID 431 

KRVGICHIYHVSQSSCFHNNKFSRYFTQFSFNIIVHSIFLSQYFSFW 



SeqID 432 
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NSTTSASLISGLISSRSGRRV^PSSFSAS1TOTKAGRPKA7ASTASRIAGTFIDFSRTEITCPGVTRYDGISTRLPST 
RI 

SeqID 433 

TRPLVEILVDKLAPYKEANFSRNS CTLGYiQAFAVTNI 
SeqID 434 

RTRNAIRYPKASSGDYGTKREIITANKDKYSISKMCR 

KI KI CLNNEGITL S RRRIRRIMKRLNLVS VYQKATFKPHSRGKNEAP I PNHLDRQFKQERPLQALVTDLT YVRVGNR 
WAYVCLIIDLYNREIIGLSLGWHKTAELVK^^ 

PYDNAVAESTYRAFKIEFWQETFQSLEEIiALKTEKATLFCTTFIKCCIjFRFLPML 
SeqID 43 5 

atggatgtat cgtctagtcc gaatattaca tttatgctgc aatacacaga ggctaatcct 60 

caatatgtgg actatactaa cagagaagag gctgtcaaaa ttgatgaaga attgtcctta 120 

gaaacgaaca ggcaaatgat tgaaggatta actgaagacg agttgactcg tattcaggaa 18 0 

gctgtccctg aaacgcagtt gaattttagg gaatacattg attatatgaa ccgctcgtat 240 

gcaactgaag aacaatctaa agaactaaca gctatcttta ctcaagaagc agattatctt 3 00 

cagaaactac gattaatcga tctaaaaaat aagttggaat cagcttatca aaatggttca 3 60 

cttctctggc aaggagttat ttcgtttgat aatgcttttc ttgcggaaca gggattgtat 420 

gatgttgcga ctggtcaagt tgatcaaaaa gcgattaagg cagtgatgcg tgatatgatg 480 

ccaacactta tccagaaaga gggcctttct gattctgctt tttggtgggg gaatatccat 540 

ctgaatacag ataatatcca tatccatttt gggctttctg aagttgaatc taaccgtgag 600 

aaaatattct atcagccacg tggacgtatg gagtacaaag gtaacttctc tcagaaaacc 6 60 

atcaaccggt ttaaaagtgg tgtgtatcat ggattgctga aagaagaaac aagafcccaat 720 

cttctcagaa aagaacagat tttagctaat ctcaaagcgg acttcataac atctatttac 78 0 

cagaaggaca agattacttc ttcagctgaa aaaaattttt tggaacaagc ctacaatcat 840 

ttgccgctaa ataagaagtg gcgctatggt tctaatgcca gagattttgc ggttagtaag 90 0 

ttctttcttg atcgctattt agattcctat ttaaacaatg aaggtagtgc tgcctatcaa 960 

gaatttttga aagagactag agattttctt cagacttatg aaggggttta ttcagctgaa 1020 

aaaaataaaa tctatgaaaa actacgtaaa gttgatgggc aaacgatcag aacgcttgca 108 0 

gaatcaaaag gatatgattt agaacatcat ttggcacgtc gtgtaatgga tttaagagag 1140 

cgtttagcca ataatatctt acgttcgttt agagaagctg caccccaaat tcaagacgtt 1200 

cagctggaaa aaaatttaga gagtttttct gttttgaacc agaagaaaat tttagaacaa 126 0 

catcctgaag caagtgtggt aaaaagtcag aaagcttggc agaagttagg ctactttgtg 132 0 

aaggctggag agcagccact tgaaattata aggccagtct ataaatctta tgataagcat 13 8 0 

ggtaaaggta taggacggcc agaatttgta tcagatactg tttatgatat tagtcagcta 144 0 

acagaaaata ttcagctgaa aagtctaacc ttgaaagacc tctctctgtt ttcttctaac 150 0 

gagttaaaag agttagtaga tgctgctaag ttaaagacta atccaacaga gagagaacgc 156 0 

cgtgaattag gtacctatcg ttatgcgttg aaacttagca tattagaatc tagtcagaag 1620 

gaattgcagg ttcgtcaaaa gctactagaa caggtacagc cactagcgtc tgatcaacca 168 0 

tttttagatt ttaagaaaca gttaatagct caggaattac aagctatagc gctacaattg 174 0 

actcctaatt acaagctatc agaagatgat aaagccttga aaaatcgatt gaagaggcag 180 0 

tttgaagata gtgttgcgct acctgtttca aaagctactc ctggtgccat acaacttcct 1860 

attaggcaac tttggactga gctaggattg gttcatcaca ttcaagatga aaacattcta 192 0 

acgcttctga aagggacatc aacgacgaaa caagcttata tagaagaact tcagactcat 198 0 

atctctattt ttcagttgaa atatcagatt aacaatagaa acaagcagat aagccagtta 204 0 

tcggatgaag caacaataaa ggagatgagg atagctaatg ctaaaggttt ctctgagcta 210 0 

aaacgtctat atgatacatt acagccatca gatgatggtc aaaatcagat tagtcaagct 216 0 

gtttctaaac aattacaaga acgaaaagtt atcaaaaaag ctcaattaca acagacgcag 222 0 

agaagcggaa aaatcaatac agacttcatg cgacaattga cagcttctct taatcgttca 228 0 

caacaagcaa gtaaaaaagc attgatggaa cgtgcacgta gtgatgaacg tgaggaacaa 234 0 

gaagaacgta ggcaagctca acgttaa 23 67 

SeqID 43 6 

atgaattcta acacaaaagg tcacggattt ttccgcaagt caaaagcata cggcttagta 60 

tgtgctattg cattagcagg tgcatttaca ttagctacta gtcaagtgtc tgctgatcaa 12 0 

gttacaactc aagcaacaac tcaaacagta acgcaaaatc aagcagaaac agtaacatca 18 0 

actcaacttg ataaagcagt agctacagct aaaaaagcag ctgtagctgt tacaaccaca 24 0 

cctgcagtta atcatgcgac aactactgat gcacaagctg atttagctaa tcaaacacaa 3 00 

gctgttaaag atgttactgc aaaagcacaa gctaatacac aagctattaa agatgctact 360 

gctgaaaatg caaaaattga tgctgaaaac aaagcagagg cagagcgtgt tgcaaaagaa 42 0 

aacaaggaag gtcaagcagc cgtagatgca cgtaacaaag caggtcaagc agccgtagat 48 0 

gcacgtaata aagcgaaaca gcaagcgcaa gacgatcaaa aagcaaaaat tgatgctgaa 54 0 
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aacaaagcag agtctcaacg tgtaagtcag ttaaatgcac aaaataaagc aaaaattgac 6 00 

gcagaaaata aagatgcgca agctaaagca aatgcgacta atgcacaatt acaaaaagat 660 

tatcaagcta aattagcgga aatcaaatca gttgaagctt ataatgcagg tgtacgtcaa 720 

cgtaataaag atgcacaagc taaagcagat gcgactaacg cacagttaca aaaagactat 78 0 

caagctaaat tagcacttta taatcaagct ctaaaagcta aagcagaagc agataaacag 840 

tctattaata atgttgcttt tgatatcaaa gctcaagcta aaggtgttga taacgctgaa 9 00 

tatggaaact caatcatgac tgcaaaaact aaacctgacg gaagtttcga gtttaaccat 9 60 

gatatgatcg atggtgtgaa gacaatcggc tatgggaaat tgacaggtaa agttaatcat 1020 

cattatgttg ctaacaagga tggctctgtg acagcatttg ttgattctgt cactctttac 1080 

aagtacgagt atcgtaatgt tgctcaaaat gctgctgtta accaaaatat tgtatttaga 1140 

gttttaacaa aagatggtcg tcctattttt gaaaaagctc ataatggtaa caaaactttt 12 00 

gcagaaactt taaacaaaac tttacaactc aatcttaaat atgagcttaa accacatgct 12 60 

tccagcggta acgtcgaagt ctttaagatt catgatgact gggtacatga cacacatggg 13 20 

tctgctttag tgtcttatgt taataataat gatgctgttc ctaatgtggt catcccagaa 13 80 

cggccaactc caccaaagcc agtgaaagtt acacctgaag cagaaaaacc agtacctgaa 1440 

aagccagttg agcctaaatt ggtaacgcct acattaaaaa cttatactcc agtcaaattt 1500 

attccgcgag aatacaaacc agaaccaatc acccctgaga cgtttacccc tgagaaattt 1560 

actccagctc aaccaaaagt gaaaccacat gtgtctattc ctgaaaagat taactactca 162 0 

gttagtgttc atcctgtttt agttccagct gctaatcctt caaaagctgt cattgatgaa 168 0 

gcaggtcaat ctgttaatgg taaaaccgta ttaccaaatg caacattaga ctatgttgct 174 0 

aaacaaaact ttagtcaata caaaggtatt aaagcttctg cagaagcgat cgcaaaaggt 180 0 

tttgcatttg tagatcaacc aaatgaagcg ttagctgaat tgactgttaa gtctatcaaa 1860 

gcatctaatg gtgatgatgt atcaagcttg ttagaaatgc gtcatgtttt atcaaaagat 192 0 

actttagacc aaaaacttca atctcttatt aaagaggcag gaattagtcc agttggtgag 198 0 

ttttacatgt ggactgcaaa agatccacaa gctttttata aagcttatgt tcaaaaagga 2 04 0 

ctagatatca cttataatct atcctttaaa atcaaagcta actttactaa aggtcaaatc 2100 

aaaaatggtg ttgcacagat tgattttggg aatggatata caggtaatat tgtagtcaat 2160 

gatgttactg ttccagaagt acataaggat atacttgata aagaggatgg taaatcaatt 2220 

aataatagta cggttaagtt aggtgatgaa gtgacctaca aacttgaagg atgggttgta 228 0 

ccagcaaacc gtggttacga tctttttgaa tacaaatttg tggatcaatt acaacacaca 234 0 

catgatcttt acttacgtga taaagtggtc gctaaagttg atgtgacatt aaaagatggt 240 0 

acagtcatta aaaaagggac taatttagga gagtacacag aaacagttta caacaaaacg 2460 

acaggtcatt atgagcttgc ctttaaaaaa gagtttttag ctaaggtttc tcgtgaatca 252 0 

gaatttggtg cagatgattt tattgtagtt aaacgtatta aggcaggtga tgtttacaat 258 0 

actgctgatt tatatgttaa tggatataaa gttaagtcag aagcagttgt gactcatact 264 0 

actgagaaat caaaaccagt tgaaccacaa aaagcaactc caaaagctcc agctaaagga 270 0 

ctgccatcaa ctggtgaagc tagtatgacg ccacttactg caattggagc aattatctta 2760 

tcagctctag gcctcgcagg ctttaaaaag cgtcaaaaat aa 2802 

SeqID 437 

atgtttagtt ggttggaagc tctttattac actctgatac aactggcaaa agtgaaccgg 6 0 

ttgaatgctc ttttcttagt tagcgttgtg ggttatcttt gttaccaggg aataaaactc 120 

gtcagaaaaa ccataagaaa cttttttcag ctgatgaagg gtttcatagg tgatagagag 18 0 

aacatcaaga aatgcatcaa aaacaagaaa gaggcactag tccattcttg gaaacatcgt 24 0 

caagatattg attggaaatc aactggaaaa gataagagta aacagttatg gaatcttatg 3 00 

aagcgacttg cgacagttgc tccttcattt ttgttcttat tattgggaaa tgttctcttt 360 

cgtctcattt atcaactgcc ttttgtaaag caagacagaa agcgatttga caaggaaatg 42 0 

aagcccttgc tctacttcaa gaactatcgt agttttgtgt tcatgggaat aggtttcagt 48 0 

ttcatagcgt ttattctcac aaactatttt gtgacggttt taagggctgc tattcgtttt 54 0 

ctatatttct caatcatgac gttaagagat aatagccaag tcgttagctt taacgttgat 600 

agtttgctca tccagaattt attcaatgct agggtatttg tgatagctcc cattctagca 66 0 

gtgccaatct ttctcattgg tttagtcgta gcttggcggt ctgcttgggt taactttgaa 72 0 

cagtaccgtg attataatca taatgaagaa ggggatgatc gctttgcgac tgtcaaagaa 780 

atccaccagc aatataagaa agttcctaat aaaacggaaa cttatccagg tgaaggaggt 84 0 

gtgcccgttc ttcatgaaac aagaaagaat ttgacaggct taacgcttaa atctcaaatg 90 0 

ctttggcaaa accgtacctt tagtcgctat ttaacgaatg cggaaaggat tttagggctc 960 

ttatcgacgc cttcaggaga ttattacatc gatgatagca ccacaaactt gatcaccatg 102 0 

gggattactc ggtcaggtaa gggagaagct catattgccc ctattattga tattaatagc 108 0 

cgtgcggaaa ttcaaccgtc actgattatt gcagacccta aaggggaaca ttaccagtcc 114 0 

tcttataaaa ccatgcgtcg gcgtggctat gatgttaatg tcctttcttt ccaaaacatg 1200 

gattggtcca tgtcctataa ccctttagct cttgcgattg cagcagctaa gaagggttac 126 0 

tatgaaatga cacagacaag ggttaatgcg gttgcagaag ccatttatcg taaaacgaaa 132 0 

cctggtagcg gtaatggcaa tgcaaaatac tgggaagata cctccatttc cctctttaat 13 8 0 

gccattgcta tggccttaat ggaccgtgct aatgaaaccg tcaggaatgg tgaaaccgat 144 0 

gcttgggata ccgttacagt tcgtaacatt gccaagtttt tgactgactt gggttctgaa 150 0 
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gaagtctttg tcaatgattt tggagagatt gttgagaatc ctgataagaa ccaacaagtg 156 0 

aagaagaaat ctaaaatcac ggtttacttt gataacttgc gtaaaatcaa tcaagaacaa 162 0 

ttttccaaat ttagagatat ggctgattta aactttaggt cttctgactt cgcttcagaa 168 0 

gaaactaagg gaaatgtctt ctctagcatg atgtcaggta ttaacttatt cttgcaagat 1740 

aatattgcta aactaacctc taaaaactct attgacctag aatcggttgg tttcccacgt 18 0 0 

cgcttgtcta tcaagtttcg ttctagttcc aatgtcgcta tgcgtaacga atacactcat 186 0 

aagacggcta aggttaccat tactagtcaa gctgtttggg gtaaaaccac taaacaagtt 192 0 

atccacgtag atgctgcaac agctctgatt gatggtgaag gctatctaac ctatgtgatt 198 0 

gaaccccagc ttcctgatca attcttggta acaattgact ttaatcacga aaacaatggt 2 04 0 

ggttcagcta ttcgtcacaa aactttccaa ttctcagctg agaaagtcta taagaaacgt 2100 

ggtaacgtta ttacgttgga tgactacacg aaaaaaccag ttttggatca tatcaaagtt 216 0 

actgttctca acaagcaaga tgataacctt ctccagaaag aagatattga cctgatttat 222 0 

tcagataatc ctaaagtgat ttacttggta acacctccaa ataggactga atataatagt 22 8 0 

attgtatctc tgtttttgga tcaattgttt aatgccaatt atgagttagc tctgtcaaat 234 0 

ggtcgcaagt gtgtcaatcg aattcttcat atcctcgatg aattcacaaa cattccagct 24 0 0 

attcctcaca tggataccaa gatttccatt ggtcttggtc aaaatattct ctactatctc 2460 

tggattcaga acttgaaaca gttagtcagt gaatatggcg agaatacagc ggaaaccatt 2520 

cgggagaact gttctttgaa agtttatatc aaatctactg ccccagcgac caacgagtac 25 80 

ttcagtaaag agttggggac tcggaccatt acacgtcgca gaaggtcaag taatattcta 2 640 

gatgaagcta atccaaatgt ttccattgaa aatcctagac aggaactctt aacaccgaca 2 700 

cagctctcga aactccaaga aggggaagcg gttattttgc gtggtgttaa aggtcgagac 2760 

aatgcaggtc ggaaaatcac aacggatccg attttcttgc atgagaaaac gagccttcct 2 820 

tatcgctaca tgttcttaca agaagaattt gaccagtcga tggctttggc agatattcca 28 8 0 

gtggaaagtg ggcataggga ccttgacctg caagatatag cagtaggggc acaaagcact 2 940 

tttaataaga ttattgattg gcggatggct ctaactgacc gtatgagaac aaatgggaag 3 0 00 

atacctcaat tagcatcaag aaaacaaacg attaaagctc taagtcaatc tcaatttact 3 060 

tctccagcag acctaacaca agctgtgatt gcagaggtat ttgatgagga agatgatgac 3120 

gatcttttct ttgtggatga tgtcatgtaa 3150 

SeqID 43 8 

atgacagata atcgctttgc ccaattaaaa gaaaactttg aaaagggatc tcctaaaagg 60 

cgagttccaa cgtctcgccc aatcgcagct caaaaagcgc ctgagagcta taacaaaaag 120 

ggacggtatc cattttcgct ccaccaagat gtgcgttatg ataaattaga agcattagta 18 0 

gcttatcatg gagctaagtc tgcatcagat tatctggaga ggttgattgt tcaggaatgg 240 

gaaaagatgc agcggaagct taagaacaaa gaaaaataa 279 

SeqID 439 

atggcatatt tatcaaaatt atccgatttg gatccatcgt tgatggatgc ggattctgaa 6 0 

caaatttata ttcctaaagt cttgtttgag cataatgact ttaaaggctt gacctacaaa 12 0 

gagattttat tatattcttt tttgttaaat cgtttaagag agccattaga ttttattcaa 18 0 

aaaggctatg atgataatga agatacctat gttcacttta aggtcgaaga tttatgtgaa 24 0 

ctactcaacc agagtaagac aaccgttatt tccttgaaga aaaggttagc tcaatatggt 300 

ttgatagaag aagtgaaagc aggtagtcac cagccgaatc gtatttattt aacagataaa 3 60 

ttagttccat atafctaaggg gtaa 384 

SeqID 440 

atgacaatct ttgatgaaag agagttaaaa gaacgattta ctcatgaaaa tagggttagt 60 

ttttatgagt ttgfcggctaa atatgatgct caaatggttc ctgtgatgaa agcaaagggc 12 0 

tatcgatgta ttcattcaat ggagcgtaca gtggttttta ottttgggga attcacgatt 18 0 

aggcgacgtc ggtggcaaaa aggagaacat tgggtggtgc cagttgatga aaagctagga 240 

ctgaaaaaga atgttcgcta ctctttagaa ttcafcgtatc aaattgctag tttagcaacc 300 

atgatgcctt atgaaaaagt gattaaggtt gttcagatga tgtattgtat tgtgattacc 36 0 

aaacctactg ttgtgaaggc cgttaagatt agtcgtgaat tgcttaaaga aaaggaagcg 42 0 

tatcgttttt ttgatgaaga tataccagta gataaagaac cagttgatat gatttatctt 48 0 

gaaggagatg gagtcatggt caaagctcga gaagaaggat tagataatcg caatgttgat 54 0 

ttgtctcatt tcgtggttca tacgggtagt cagaaagtag gaagcaatcg ctttgtctta 600 

caaaataaaa aagagtttgt gtcccttgat aatcgtcaaa cgcgtcaaaa gattttggac 66 0 

tacctttaca atcattttta cattgcccca aacaccttgc ttattaccaa ttcagatggt 72 0 

ggccatggct ataccccata tgtttttaaa gagattgcga aagcactcaa agtgaagcaa 78 0 

cacgaacact tttgggatag gtatcatgtc aatgaaaaga tcaaaagttt ttttaaactc 84 0 

tatccagtgg aactgatgac tggcgctttt cagagcatta aacagcatga taaagaaaag 90 0 

cttagaactg ttttggatac gacagaagca ttgatactga tggaagaaga aatggaaggg 96 0 

tttaaccagt tcaaacgaaa gttgttaaac aattttcaat atactaaatc agctgaattg 102 0 

agaggtttca gtcgtgcagg aataggtgtt atggagtcac aacatcgaaa aattacctat 108 0 

cggatgaaaa agcgcgggat gtattggaca attcagggag ctgagacgat gagtcaatta 114 0 
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attgtcctat cgtatgaagg acagctaaga gatctcttct ttggttcttg gcgtgaagac 
tatcagaaat atcaagagct agaaaacctt agtgcgggaa aaatcaaaca tgaacagaat 
aaaatcaata agagatatga tctccagaca cttggtcggc ttaggtacgg taggcatcgt 
aatttatag 



1200 
1260 
1320 
1329 



SeqlD 441 
atggatgtat 
caatatgtgg 
gaaacgaaca 
gctgtccctg 
gcaactgaag 
cagaaactac 
cttctctggc 
gatgttgcga 
ccaacactta 
ctgaatacag 
aaaatattct 
atcaaccggt 
cttctcagaa 
cagaaggaca 
ttgccgctaa 
ttctttcttg 
gaatttttga 
aaaaataaaa 
gaatcaaaag 
cgtttagcca 
cagctggaaa 
catcctgaag 
aaggctggag 
ggtaaaggta 
acagaaaata 
gagttaaaag 
cgtgaattag 
gaattgcagg 
tttttagatt 
actcctaatt 
tttgaagata 
attaggcaac 
acgcttctga 
atctctattt 
tcggatgaag 
aaacgtctat 
gtttctaaac 
agaagcggaa 
caacaagcaa 
gaagaacgta 



cgtctagtcc 
actatactaa 
ggcaaatgat 
aaacgcagtt 
aacaatctaa 
gattaatcga 
aaggagttat 
ctggtcaagt 
tccagaaaga 
ataatatcca 
atcagccacg 
ttaaaagtgg 
aagaacagat 
agattacttc 
ataagaagtg 
atcgctattt 
aagagactag 
tctatgaaaa 
gatatgattt 
ataatatctt 
aaaatttaga 
caagtgtggt 
agcagccact 
taggacggcc 
ttcagctgaa 
agttagtaga 
gtacctatcg 
ttcgtcaaaa 
ttaagaaaca 
acaagctatc 
gtgttgcgct 
tttggactga 
aagggacafcc 
ttcagttgaa 
caacaataaa 
atgatacatt 
aattacaaga 
aaatcaatac 
gtaaaaaagc 
ggcaagctca 



gaatattaca 
cagagaagag 
tgaaggatta 
gaattttagg 
agaactaaca 
tctaaaaaat 
ttcgtttgat 
tgatcaaaaa 
gggcctttct 
tatccatttt 
tggacgtatg 
tgtgtatcat 
tttagctaat 
ttcagctgaa 
gcgctatggt 
agattcctat 
agattttctt 
actacgtaaa 
agaacatcat 
acgttcgttt 
gagtttttct 
aaaaagtcag 
tgaaattata 
agaatttgta 
aagtctaacc 
tgctgctaag 
ttatgcgttg 
gctactagaa 
gttaatagct 
agaagatgat 
acctgtttca 
gctaggattg 
aacgacgaaa 
atatcagatt 
ggagatgagg 
acagccatca 
acgaaaagtt 
agacttcatg 
attgatggaa 
acgttaa 



tttatgctgc 
gctgtcaaaa 
actgaagacg 
gaatacattg 
gctatcttta 
aagttggaat 
aatgcttttc 
gcgattaagg 
gattctgctt 
gggctttctg 
gagtacaaag 
ggattgctga 
ctcaaagcgg 
aaaaattttt 
tctaatgcca 
ttaaacaatg 
cagacttatg 
gttgatgggc 
ttggcacgtc 
agagaagctg 
gttttgaacc 
aaagcttggc 
aggccagtct 
tcagatactg 
ttgaaagacc 
ttaaagacta 
aaacttagca 
caggtacagc 
caggaattac 
aaagccttga 
aaagctactc 
gttcatcaca 
caagcttata 
aacaatagaa 
atagctaatg 
gatgatggtc 
atcaaaaaag 
cgacaattga 
cgtgcacgta 



aatacacaga 
ttgatgaaga 
agttgactcg 
attatatgaa 
ctcaagaagc 
cagcttatca 
ttgcggaaca 
cagtgatgcg 
tttggtgggg 
aagttgaatc 
gtaacttctc 
aagaagaaac 
acttcataac 
tggaacaagc 
gagattttgc 
aaggtagtgc 
aaggggttta 
aaacgatcag 
gtgtaatgga 
caccccaaat 
agaagaaaat 
agaagttagg 
ataaatctta 
tttatgatat 
tctctctgtt 
atccaacaga 
tattagaatc 
cactagcgtc 
aagctatagc 
aaaatcgatt 
ctggtgccat 
ttcaagatga 
tagaagaact 
acaagcagat 
ctaaaggttt 
aaaatcagat 
ctcaattaca 
cagcttctct 
gtgatgaacg 



ggctaatcct 
attgtcctta 
tattcaggaa 
ccgctcgfcat 
agattatctt 
aaatggttca 
gggattgtat 
tgatatgatg 
gaatatccat 
taaccgtgag 
tcagaaaacc 
aagatccaat 
atctatttac 
ctacaatcat 
ggttagtaag 
tgcctatcaa 
ttcagctgaa 
aacgcttgca 
tttaagagag 
tcaagacgtt 
tttagaacaa 
ctactttgtg 
tgataagcat 
tagtcagcta 
ttcttctaac 
gagagaacgc 
tagtcagaag 
tgatcaacca 
gctacaattg 
gaagaggcag 
acaacttcct 
aaacattcta 
tcagactcat 
aagccagtta 
ctctgagcta 
tagtcaagct 
acagacgcag 
taatcgttca 
tgaggaacaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2367 



SeqlD 442 

atgaattcta 

tgtgctattg 

gttacaactc 

actcaacttg 

cctgcagtta 

gctgttaaag 

gctgaaaatg 

aacaaggaag 

gcacgtaata 

aacaaagcag 

gcagaaaata 

tatcaagcta 

cgtaataaag 

caagctaaat 

tctattaata 

tatggaaact 

gatatgatcg 



acacaaaagg 
cattagcagg 
aagcaacaac 
ataaagcagt 
atcatgcgac 
atgttactgc 
caaaaattga 
gtcaagcagc 
aagcgaaaca 
agtctcaacg 
aagatgcgca 
aattagcgga 
atgcacaagc 
tagcacttta 
atgttgcttt 
caatcatgac 
atggtgtgaa 



tcacggattt 
tgcatttaca 
tcaaacagta 
agctacagct 
aactactgat 
aaaagcacaa 
tgctgaaaac 
cgtagatgca 
gcaagcgcaa 
tgtaagtcag 
agctaaagca 
aatcaaatca 
taaagcagat 
taatcaagct 
tgatatcaaa 
tgcaaaaact 
gacaatcggc 



ttccgcaagt 
ttagctacta 
acgcaaaatc 
aaaaaagcag 
gcacaagctg 
gctaatacac 
aaagcagagg 
cgtaacaaag 
gacgatcaaa 
ttaaatgcac 
aatgcgacta 
gttgaagctt 
gcgactaacg 
ctaaaagcta 
gctcaagcta 
aaacctgacg 
tatgggaaat 



caaaagcata 
gtcaagtgtc 
aagcagaaac 
ctgtagctgt 
atttagctaa 
aagctattaa 
cagagcgtgt 
caggtcaagc 
aagcaaaaat 
aaaataaagc 
atgcacaatt 
ataatgcagg 
cacagttaca 
aagcagaagc 
aaggtgttga 
gaagtttcga 
tgacaggtaa 



cggcttagta 
tgctgatcaa 
agtaacatca 
tacaaccaca 
tcaaacacaa 
agatgctact 
tgcaaaagaa 
agccgtagat 
tgatgctgaa 
aaaaattgac 
acaaaaagat 
tgtacgtcaa 
aaaagactat 
agataaacag 
taacgctgaa 
gtttaaccat 
agttaatcat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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cattatgttg ctaacaagga tggctctgtg acagcatttg ttgattctgt cactctttac 1080 

aagtacgagt atcgtaatgt tgctcaaaat gctgctgtta accaaaatat tgtatttaga 114 0 

gttttaacaa aagatggtcg tcctattttt gaaaaagctc ataatggtaa caaaactttt 12 00 

gcagaaactt taaacaaaac tttacaactc aatcttaaat atgagcttaa accacatgct 1260 

tccagcggta acgtcgaagt ctttaagatt catgatgact gggtacatga cacacatggg 13 20 

tctgctttag tgtcttatgt taataataat gatgctgttc ctaatgtggt catcccagaa 13 80 

cggccaactc caccaaagcc agtgaaagtt acacctgaag cagaaaaacc agtacctgaa 1440 

aagccagttg agcctaaatt ggfcaacgcct acattaaaaa cttatactcc agtcaaattt 15 00 

attccgcgag aatacaaacc agaaccaatc acccctgaga cgtttacccc tgagaaattt 15 60 

actccagctc aaccaaaagt gaaaccacat gtgtctattc ctgaaaagat taactactca 1620 

gfctagtgttc atcctgtttt agttccagct gctaatcctt caaaagctgt cattgatgaa 1680 

gcaggtcaat ctgttaatgg taaaaccgta ttaccaaatg caacattaga ctatgttgct 1740 

aaacaaaact ttagtcaata caaaggtatt aaagcttctg cagaagcgat cgcaaaaggt 1800 

tttgcatttg tagatcaacc aaatgaagcg ttagctgaat tgactgttaa gtctatcaaa 1860 

gcatctaatg gtgatgatgt atcaagcttg ttagaaatgc gtcatgtttt atcaaaagat 1920 

actttagacc aaaaacttca atctcttatt aaagaggcag gaattagtcc agttggtgag 198 0 

ttttacatgt ggactgcaaa agatccacaa gctttttata aagcttatgt tcaaaaagga 2 04 0 

ctagatatca cttataatct atcctttaaa atcaaagcta actttactaa aggtcaaatc 210 0 

aaaaatggtg ttgcacagat tgattttggg aatggatata caggtaatat tgtagtcaat 2160 

gatgttactg ttccagaagt acataaggat atacttgata aagaggatgg taaatcaatt 2220 

aataatagta cggttaagtt aggtgatgaa gtgacctaca aacttgaagg atgggttgta 2280 

ccagcaaacc gtggttacga tctttttgaa tacaaatttg tggatcaatt acaacacaca 234 0 

catgatcttt acttacgtga taaagtggtc gctaaagttg atgtgacatt aaaagatggt 2400 

acagtcatta aaaaagggac taatttagga gagtacacag aaacagttta caacaaaacg 246 0 

acaggtcatt atgagcttgc ctttaaaaaa gagtttttag ctaaggtttc tcgtgaatca 252 0 

gaatttggtg cagatgattt tattgtagtt aaacgtatta aggcaggtga tgtttacaat 258 0 

actgctgatt tatatgttaa tggatataaa gttaagtcag aagcagttgt gactcatact 264 0 

actgagaaat caaaaccagt tgaaccacaa aaagcaactc caaaagctcc agctaaagga 270 0 

ctgccatcaa ctggtgaagc tagtatgacg ccacttactg caattggagc aattatctta 2760 

tcagctctag gcctcgcagg ctttaaaaag cgtcaaaaat aa 2802 

SeqID 443 

atgtttagtt ggttggaagc tctttattac actctgatac aactggcaaa agtgaaccgg 60 

ttgaatgctc ttttcttagt tagcgttgtg ggttatcttt gttaccaggg aataaaactc 12 0 

gtcagaaaaa ccataagaaa cttttttcag ctgatgaagg gtttcatagg tgatagagag 18 0 

aacatcaaga aatgcatcaa aaacaagaaa gaggcactag tccattcttg gaaacatcgt 24 0 

caagatattg attggaaatc aactggaaaa gataagagta aacagttatg gaatcttatg 300 

aagcgacttg cgacagttgc tccttcattt ttgttcttat tattgggaaa tgttctcttt 360 

cgtctcattt atcaactgcc ttttgtaaag caagacagaa agcgatttga caaggaaatg 42 0 

aagcccttgc tctacttcaa gaactatcgt agttttgtgt tcatgggaat aggtttcagt 48 0 

ttcatagcgt ttattctcac aaactatttt gtgacggttt taagggctgc tattcgtttt 54 0 

ctatatttct caatcatgac gttaagagat aatagccaag tcgttagctt taacgttgat 60 0 

agtttgctca tccagaattt attcaatgct agggtatttg tgatagctcc cattctagca 66 0 

gtgccaatct ttctcattgg tttagtcgta gcttggcggt ctgcttgggt taactttgaa 72 0 

cagtaccgtg attataatca taatgaagaa ggggatgatc gctttgcgac tgtcaaagaa 78 0 

atccaccagc aatataagaa agttcctaat aaaacggaaa cttatccagg tgaaggaggt 84 0 

gtgcccgttc ttcatgaaac aagaaagaat ttgacaggct taacgcttaa atctcaaatg 900 

ctttggcaaa accgtacctt tagtcgctat ttaacgaatg cggaaaggat tttagggctc 96 0 

ttatcgacgc cttcaggaga ttattacatc gatgatagca ccacaaactt gatcaccatg 102 0 

gggattactc ggtcaggtaa gggagaagct catattgccc ctattattga tattaatagc 108 0 

cgtgcggaaa ttcaaccgtc actgattatt gcagacccta aaggggaaca ttaccagtcc 114 0 

tcttataaaa ccatgcgtcg gcgtggctat gatgttaatg tcctttcttt ccaaaacatg 12 0 0 

gattggtcca tgtcctataa ccctttagct cttgcgattg cagcagctaa gaagggttac 1260 

tatgaaatga cacagacaag ggttaatgcg gttgcagaag ccatttatcg taaaacgaaa 13 2 0 

cctggtagcg gtaatggcaa tgcaaaatac tgggaagata cctccatttc cctctttaat 13 8 0 

gccattgcta tggccttaat ggaccgtgct aatgaaaccg tcaggaatgg tgaaaccgat 144 0 

gcttgggata ccgttacagt tcgtaacatt gccaagtttt tgactgactt gggttctgaa 15 0 0 

gaagtctttg tcaatgattt tggagagatt gttgagaatc ctgataagaa ccaacaagtg 15 6 0 

aagaagaaat ctaaaatcac ggtttacttt gataacttgc gtaaaatcaa tcaagaacaa 162 0 

ttttccaaat ttagagatat ggctgattta aactttaggt cttctgactt cgcttcagaa 168 0 

gaaactaagg gaaatgtctt ctctagcatg atgtcaggta ttaacttatt cttgcaagat 174 0 

aatattgcta aactaacctc taaaaactct attgacctag aatcggttgg tttcccacgt 18 0 0 

cgcttgtcta tcaagtttcg ttctagttcc aatgtcgcta tgcgtaacga atacactcat 18 60 

aagacggcta aggttaccat tactagtcaa gctgtttggg gtaaaaccac taaacaagtt 192 0 

atccacgtag atgctgcaac agctctgatt gatggtgaag gctatctaac ctatgtgatt 198 0 
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gaaccccagc ttcctgatca attcttggta acaattgact ttaatcacga aaacaatggt 2 04 0 

ggttcagcta ttcgtcacaa aactttccaa ttctcagctg agaaagtcta taagaaacgt 2100 

ggtaacgtta ttacgttgga tgactacacg aaaaaaccag ttttggatca tatcaaagtt 2160 

actgttctca acaagcaaga tgataacctt ctccagaaag aagatattga cctgatttat 222 0 

tcagataatc ctaaagtgat ttacttggta acacctccaa ataggactga atataatagt 22 8 0 

attgtatctc tgtttttgga tcaattgttt aatgccaatt atgagttagc tctgtcaaat 2340 

ggtcgcaagt gtgtcaatcg aattcttcat atcctcgatg aattcacaaa cattccagct 24 0 0 

attcctcaca tggataccaa gatttccatt ggtcttggtc aaaatattct ctactatctc 2460 

tggattcaga acttgaaaca gttagtcagt gaatatggcg agaatacagc ggaaaccatt 252 0 

cgggagaact gttctttgaa agtttatatc aaatctactg ccccagcgac caacgagtac 2580 

ttcagtaaag agttggggac tcggaccatt acacgtcgca gaaggtcaag taatattcta 2640 

gatgaagcta atccaaatgt ttccattgaa aatcctagac aggaactctt aacaccgaca 27 00 

cagctctcga aactccaaga aggggaagcg gttattttgc gtggtgttaa aggtcgagac 2760 

aatgcaggtc ggaaaatcac aacggatccg attttcttgc atgagaaaac gagccttcct 2 820 

tatcgctaca tgttcttaca agaagaattt gaccagtcga tggctttggc agatattcca 28 80 

gtggaaagtg ggcataggga ccttgacctg caagatatag cagtaggggc acaaagcact 2 940 

tttaataaga ttattgattg gcggatggct ctaactgacc gtatgagaac aaatgggaag 3 0 00 

atacctcaat tagcatcaag aaaacaaacg attaaagctc taagtcaatc tcaatttact 3 0 60 

tctccagcag acctaacaca agctgtgatt gcagaggtat ttgatgagga agatgatgac 3120 

gatcttttct ttgtggatga tgtcatgtaa 3150 

SeqID 444 

atgacagata atcgctttgc ccaattaaaa gaaaactttg aaaagggatc tcctaaaagg 60 

cgagttccaa cgtctcgccc aatcgcagct caaaaagcgc ctgagagcta taacaaaaag 12 0 

ggacggtatc cattttcgct ccaccaagat gtgcgttatg ataaattaga agcattagta 18 0 

gcttatcatg gagctaagtc tgcatcagat tatctggaga ggttgattgt tcaggaatgg 240 

gaaaagatgc agcggaagct taagaacaaa gaaaaataa 2 79 

SeqID 44 5 

atggcatatt tatcaaaatt atccgatttg gatccatcgt tgatggatgc ggattctgaa 60 

caaatttata ttcctaaagt cttgtttgag cataatgact ttaaaggctt gacctacaaa 120 

gagattttat tatattcttt tttgttaaat cgtttaagag agccattaga ttttattcaa 180 

aaaggctatg atgataatga agatacctat gttcacttta aggtcgaaga tttatgtgaa 240 

ctactcaacc agagtaagac aaccgttatt tccttgaaga aaaggttagc tcaatatggt 3 00 

ttgatagaag aagtgaaagc aggtagtcac cagccgaatc gtatttattt aacagataaa 3 60 

ttagttccat atattaaggg gtaa 3 84 

SeqID 44 6 

atgacaatct ttgatgaaag agagttaaaa gaacgattta ctcatgaaaa tagggttagt 60 

tfcttatgagt ttgtggctaa atatgatgct caaatggttc ctgtgatgaa agcaaagggc 12 0 

tatcgatgta ttcattcaat ggagcgtaca gtggttttta cttttgggga attcacgatt 18 0 

aggcgacgtc ggtggcaaaa aggagaacat tgggtggtgc cagttgatga aaagctagga 24 0 

ctgaaaaaga atgttcgcta ctctttagaa ttcatgtatc aaattgctag tttagcaacc 300 

atgatgcctt atgaaaaagt gattaaggtt gttcagatga tgtattgtat tgtgattacc 3 60 

aaacctactg ttgtgaaggc cgttaagatt agtcgtgaat tgcttaaaga aaaggaagcg 420 

tatcgttttt ttgatgaaga tataccagta gataaagaac cagttgatat gatttatctt 480 

gaaggagatg gagtcatggt caaagctcga gaagaaggat tagataatcg caatgttgat 54 0 

ttgtctcatt tcgtggttca tacgggtagt cagaaagtag gaagcaatcg ctttgtctta 600 

caaaataaaa aagagtttgt gtcccttgat aatcgtcaaa cgcgtcaaaa gattttggac 660 

tacctttaca atcattttta cattgcccca aacaccttgc ttattaccaa ttcagatggt 720 

ggccatggct ataccccata tgtttttaaa gagattgcga aagcactcaa agtgaagcaa 780 

cacgaacact tfctgggatag gtatcatgtc aatgaaaaga tcaaaagttt ttttaaactc 840 

tatccagtgg aactgatgac tggcgctttt cagagcatta aacagcatga taaagaaaag 900 

cttagaactg ttttggatac gacagaagca ttgatactga tggaagaaga aatggaaggg 9 60 

tttaaccagt tcaaacgaaa gttgttaaac aattttcaat atactaaatc agctgaattg 1020 

agaggtttca gtcgtgcagg aataggtgtt atggagtcac aacatcgaaa aattacctat 1080 

cggatgaaaa agcgcgggat gtattggaca attcagggag ctgagacgat gagtcaatta 1140 

attgtcctat cgtatgaagg acagctaaga gatctcttct ttggttcttg gcgtgaagac 1200 

tatcagaaat atcaagagct agaaaacctt agtgcgggaa aaatcaaaca tgaacagaat 1260 

aaaatcaata agagatatga tctccagaca cttggtcggc ttaggtacgg taggcatcgt 1320 

aatttatag 1329 



SeqID 44 7 

atggctgtcg aaattattat gcctaaactc ggggttgata tgcaagaagg cgaaatcctt 
gagtggaaaa aacaggtagg tgatgttgtt aatgaaggtg acgtcctact tgaaattatg 



60 
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tcagataaaa 
catgggaatg 
gaagaagtaa 
gtaacttcag 
agtggtgaaa 
gaccttgcac 
agcttcaagg 
ggagttgata 
gtfcttggctg 
acaacagaag 
aaggcfcattt 
tatgacattg 
gctaaaacag 
ttgatgaaac 
gagcttcata 
cctgttgttc 
gttattaaga 
tcaattacta 
aactcagcaa 
attgtagctc 
atgaatggtg 
cttatttga 



caaatatgga 
gagatgttgt 
cagaagcatc 
aaccagaaaa 
aagtacgtgc 
tagtatctgg 
gtgctcaacc 
tagcagaaat 
ctatgtcacc 
aaaaacaatt 
ctaaaggtat 
atatgactga 
gcttgaaagt 
cagagcatcg 
attttgttaa 
acaatgctga 
aaactcaaga 
acttaggtat 
tccttggagt 
gaccaatcat 
ctaagtttat 



gattgaagca 
tcctgtaact 
ttcttctgaa 
agttgaagaa 
cactcctgca 
aacaggggca 
acgtattaca 
cactggcagt 
gcaagcagca 
accagaaggt 
gactaactct 
aatgatggct 
cagctttact 
atacctaaat 
tattggaatc 
tcaaatgtca 
aggtaaatta 
gtttggtact 
aggcgctact 
ggcaatgtgt 
ggttgatctt 
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gaagactcag 
gagacaattg 
aatacatcag 
acatcagaac 
gctcgtaaat 
aatggtcgcg 
ccattggcac 
ggtattcgtg 
gaggctcctg 
gttgaagtca 
tacctcactg 
cttcgcaaga 
gatttgattg 
gcttcattaa 
gcagttggtc 
ctttctgact 
aaatcagctg 
aagacgttta 
attccgactc 
ttaacaattg 
aagaacttaa 



gtgttttact 
gctatatcgg 
ttgaagaaaa 
caagtgtgcc 
tagcaagaga 
ttcacagaga 
gtcgaattgc 
gtaaaattgt 
tagaaacgaa 
ttaagatgtc 
ctccaagctt 
aattaattga 
gaatggccgt 
ttaacgatgc 
ttgatgatgg 
ttgttatagc 
aaatgtcagg 
acccaattat 
caacagtggt 
accaccgtat 
tggaaaaccc 



aaaaatcaca 
tgctgaaggt 
cgcgacacaa 
agcagctaca 
gatgtctatt 
agatgttgaa 
tgaagatcaa 
taaaaatgat 
agctacacca 
tgcaatgcgt 
tacacttaac 
tccaatcatg 
tgttaagaca 
acaagaaatt 
cttgattgtt 
atctaaagat 
ttcaacattc 
caatcaaccg 
tgatggtgaa 
tgttgatgga 
atttggttta 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1389 



SeqID 448 

afcgtctagtt 

gccaccacag 

gttattaatt 

gcttattcaa 

ggcaaaccaa 

actgctacac 

tttgtttaca 

gcgacagaat 

aataaacttg 

cctcaattta 

gaaaaagttg 

cttgttaagg 

tgggattcta 

actgcagttc 

gctatttata 

ttgaatttag 

cctgctgaat 

aagtatactg 

ggtcttgctg 

ccagcagcta 

ctaactgttg 

aattttgatg 

gggttattca 

aatgcctatc 

tacaaaactg 

gggaaaaaag 

aatgttgatt 



ttaatagaaa 
ttaccctggt 
ggtatattcc 
acttagctat 
aacccgattt 
ttcgtgataa 
catggcggag 
cgcacctttt 
gtgtaacagc 
aatattattt 
gtaaagatta 
attggaatgg 
aacatgttaa 
aaatgtacaa 
accaaactgg 
cgactgatcg 
ctctggtccc 
ccccaggtta 
aggttggtca 
aaaatgctgt 
aagaaaaatt 
tcgtgctctt 
cgactaattc 
aaaaggcaat 
ctgaaaaagc 
gactccaaaa 
fctacttatgc 



aaaacttaag 
tgcctgtggt 
tactgaaatc 
aggtaattca 
agctaaaaaa 
tttaaaatgg 
aatagttgat 
aaatgctgat 
taaaggcaat 
agccttctct 
tggtacaaca 
ttcaaatggt 
aacaaatagt 
gcaaggtcag 
ttcggttaaa 
taaaggtgtt 
taaaaaatta 
tacgtataat 
gtcatctctc 
tgactatgtg 
tgttactttc 
ctcatgggga 
tgcgtataat 
tactactgac 
tttatttgat 
ttctaaatta 
ttataagaca 



tttttgggta 
aatgagtcaa 
agtacattgg 
ggcagcaatt 
gtatccgttt 
tcagatggaa 
cctaaaactg 
aagattaatt 
caagtgacct 
aattttatgc 
tcaaaaaatc 
aaatttaaat 
gtcatagttc 
attgattttg 
gcattgacaa 
gttaaagcgg 
gccaaattac 
acaagtaaag 
aagttaacaa 
aaatctacat 
aagcaacgcc 
ggagattatc 
tatggtaaat 
gccttgaaac 
caatcatatt 

aa 99'9'9 tt 99" 
gaataa 



tatctttggc 
agaactctgg 
acatttcaaa 
tacttagaat 
cttctgatgg 
gcaaactaag 
catctgaata 
ctggagatat 
ttaaattgac 
ctcaaaaaca 
aaatttactc 
tagtcaaaaa 
aaactatcaa 
ctgaaatatc 
atcagaaaat 
cagttgatac 
ctaatggtga 
ctcaaaaact 
tcacagctga 
gggagagtgc 
ttgaagatgc 
cagaaggatc 
tttcttctaa 
caggagatgc 
ataacccagt 
taagaaattc 



gactctaaca 
agataataaa 
gaatacagat 
tgataaagag 
tcttacttat 
cgctgaagat 
tgcttacttg 
taaagattta 
aagtccatgc 
atcgtatgtt 
aggtccttac 
taaatattat 
aaaaccagat 
aggaacttct 
taggcaagct 
aggttcaact 
ggatctttcc 
ttttaaagaa 
ttcggactct 
tttgcctggt 
aaaaaatgaa 
aactttctat 
agagtatgac 
cgctaatgat 
ttattaccta 
aactggtttg 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1596 



SeqID 449 

MDVSSSPNITFMLQYTEi^PQYVDYTN^ 

RSYATEEQSKELTAIFTQEM)YLQKLRLI 

VMRDMMPTLIQKEGLSDSAFWWGNIHLNT 

LLKEETRSNLLRKEQILANLK^ 

SYLISnSTEGSAAYQEFLKETRDFLQTYEGV 

ILRSFREAAPQIQDVQLEKNLESFSVLNQKKILEQHPEASWKSQKAWQKLGYFVKAGEQPLEIIRPVYKSYDKHGK 

GIGRPEFVSDTVYDISQLTENIQLKSLTLKDLSL 

KELQVRQKLLEQVQPLASDQPFLDFKKQLIAQE 

IQLPIRQLWTELGLVHHIQDENILTLLKGT^ 

AKGFSELKRLYDTLQPSDDGQNQISQAVSK^ 

RARSDEREEQEERRQAQR 
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T TT p AVNHATT TDAQAELAJSIQTQAVKDVTAKAQANTQAI KDAT AENAKI DAENKAEAERVAKENKE GQAAVDARNKA 
GQAAVDARNKAKQQAQDDQKAKIDAENKAE S QRVS QLNAQNKAKIDAENKDAQAKANATNAQLQKDYQAKLAE IKS V 
E AYNAGVRQRNKD AQAKADATNAQLQKDYQAKIiAL YNQALKAKAEADKQ S INNVAFD I KAQAKGVDNAE YGNS IMT A 
KTKPDGSFEFNHDMIDGVKTIGYGKLTC 

GRPI FEKAHNGNKTFAETLNKTIaQLNLKYELKPI^ PERP 
TPPKPVKOTPEAEKPVPEKPVEPKLWPTLKTYTPVKFIPREYKPEPITPETFTPEKFTPAQPKVKPHVSIPEKI^ 

S VSVHPVLVPAANP S KAVIDEAGQ S VNGKTVLPNATLDYVAKQNF S QYKGIKASAEAI AKGFAFVDQPNEALAELTV 

KS IKASNGDDVSSLLEMRHVLSKDTLDQKIjQSLIKEAGI s pvgefymwtakdpqafykayvqkglditynls FKIKA 

NFTKGQIKNGVAQIDFGNGYTGNIWt^^ 

YKFVDQLQHTHDLYLRDKVVAKVDVTL^ 

I VVKRIKAGDVYNTADL YVNGYKVKS EAVVTHTTEKSKPVEPQKATPKAPAKGLP STGEASMT PLTAI GAI I L S ALG 
LAGFKKRQK 

SeqID 451 

MF S WLEALY YTL I QLAKVNRLNALFLVS VVGYL> CYQGIKLVRKT IRNFFQLMKGFI GDRENIKECCI KNKKEALVHS W 
KHRQD IDWKSTGKDKS KQLWWLMKRLiATVAPS FLFLLLGITVLFRLI YQLPFVKQDRKRFDKEMK^ 
MGIGFSFIAFILTNYFOTVLRAAIRFLYFSIMTLRDNSQW 

WRSAWVNFEQYRDYNHNEEGDDRFATVKEIHQQYKKVPNKTETYPGEGGVPVLHETRKNIiTGLTL 

RYLTNAERILGLLSTPSGDYYIDDSTTNLITMGITRSGKGEAHIAPIIDINSRAEIQPSLIIADPKGEHYQSSYKTM 

RRRGYDVNVLS FQNMDWSMS YNPIxALAIAAAK^ SLFNAI 

AMALMDRANETVRNGETO^^ 

QFSKFRDMADLNFRSSDFASEETKGIWFSSM^^ 

EYTHKTAKVTITSQAVWGKTTKQVIHVDAATAL 

EKVYKKRGNVITLDDYTKKPVLDHIKVTV^ 

NAJSTCEI^SNGRKCVNRILHILDEFTNIPAIPH^ 

VYI KS TAPATNE YFS KELGTRT ITRRRRS SNI LDEANPNVS IENPRQELLTPTQLSKLQEGEAVI LRGVKGRDNAGR 

KITTDPIFLHEKTSIiPYRYMFLQEEFDQSMALADIPVESGHRDLDLQDIAVGAQSTFNKIIDWRM^ 

PQIiASRKQTIKALSQSQFTSPADLTQAVIAEVFDEEDDDDLFFVDDVM 

SeqID 452 

MTDNRFAQLKENFEKGS PKRRVPT SRP I AAQKAPES YNICKGRYPFS LHQDVRYDKLEAIjVAYHGAKSASD YLERL IV 
QEWEKMQRKLKNKEK 

SeqID 453 

MAYLSKLSDLDPSLMDADSEQIYIPKVLFEHND 

LCELLNQSKTTVISLKKRIiAQYGLIEEVKAGSHQPNRIYLTDKLVPYIKG 
SeqID 454 

MTIFDERELKERFTHENRVSFYEFVAKYDAQMVPWKAKGYRCI^ 
KLGLKKNVRYSLEFMYQIASIjAT^ 

VDMIYLEGDGVMVKAREEGLDNR1TVDLSHFVVHTGSQKVGSNRFVLQNK 

TLLITNSDGGHGYTPYVFKEIAKALKVKQHEHFWDRYHVNEKIKSFFKLYPVELMTGAFQSIK^ 
EALILMEEEMEGFNQFKRKLnOTtfFQYTKS 

EGQLRDLFFGSWREDYQKYQELENLSAGKIKHEQNKINKRYDLQTLGRLRYGRHRNL 
SeqID 455 

MDVSSSPNITFMLQYTEAI^QYVDYTNREEAVKIDEELSLETNRQMIEGLTEDELTRIQEAVPETQLNFREYIDYMN 
RSYATEEQSKELTAIFTQEADYLQKLRLIDLKNKLESAYQNGSLLWQGVISFDNAFLiAEQGLYDVATGQVDQKAIKA 

VMRDMMPTLIQKEGLSDSAFWWGNIHL^ 
LLKEETRSNLLRKEQILANLKADFITSIYQKDK^ 

SYLNNEGSAAYQEFLKETRDFLQTYEGVYSAEKNKIYEKLRKVDGQTIRTLA^ 
ILRSFREAAPQIQDVQLEKNLESFSVLNQKKILEQHPEASVVK^ 
GIGRPEFVSDTVYDISQLTENIQLKSLTLKDLSLFSSN^ 
KELQVRQKLLEQVQPLASDQPFLDFKKQLIAQELQ 

IQLPIRQLWTELGLVHHIQDENILTLLKGTSTTKQAYIEELQTHISIFQLKYQINNRNKQISQLSDEATIKEMRIAN 

AKGFSELKRLYDTIiQPSDDGQNQISQAVSKQLQERKVIKKAQLQQTQRSGKINTDFMRQLTASLNRSQQA 

RARSDEREEQEERRQAQR 

SeqID 456 

MNS NTKGHGF FRKS KAYGLVCAI ALAGAF TLAT S QVS ADQVTT QATT QTVTQNQAETVT S TQLD KAVATAKKAAVAV 

TTTPAVNHATTTDAQADLANQTQ 

GQAAVDARNKAKQQAQDDQKAKIDAENKAESQR 

E AYNAGVRQRJSJKDAQAKADATNAQLQICDYQAKIiALYNQALKAKAE ADKQS INNVAFDIKAQAKGVDNAE YGNS IMTA 
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KTKPDGSFEFNHDMIDGVKTIGYGKLTGK^ 
GRPIFEKAHNGNKTFAETLNCT 

TPPKPVKVTPEAEKPVPEKPVEPKLVTPTLKTYTPVKFI PREYKPEPITPETFTPEKFTPAQPKVKPHVS I PEKINY 
S VSVHPVLVPAANPS KAVIDEAGQS VNGKTVLPNATLD YVAKQNFS QYKGI KAS AEAI AKGFAFVDQPNEALAELTV 

KSIKASNGDDVSSLLEMRH^ 

NFTKGQIKNGVAQIDFGNGYTGNIVVTO^ 

YKFVDQLQHTHDLYLRDKWAIW^^ 

IWIOlIKAGDVlTSrT^ 

LAGFKKRQK 

SeqID 457 

MFS WLEAL YYTL IQLAKVJSTRLNAIjFIiVS VVGYLC YQGI KLVRKTI RNFFQLMKGF IGDRENI KKC I KNKKE ALVHS W 

KHRQDIDWKSTGIODKSKQLV^^ 

MGIGFSFIAFILTNYFVTVLRAAIRFLYF^^ 

WRSAWVNFEQYRDYNHNEEGDDRFATVKEIHQQYKKVPNCT 

RYLTNAERILGLLSTPSGDYYIDDSTTNLITMGITRSGKGEAHIAPIIDINSRAEIQPSLIIADPKGEHYQSSYKTM 
RRRGYDVX^SFQNlVnDW^ 

AMALMDRANETVRNGETDAWDTVT VRNI AKFLTDLGS EE VF VNDFGE I VENPDKJSFQQVKKKS KI T VYFDNLRKINQE 
QF SKFRDMADLNFRS SDFASEETKGNVFS SMMSGINLFLQDNIAKLTSKNS IDLESVGFPRRIiS I KFRS S SNVAMRN 
EYTHKTAKVTITSQAWGKTTKQVIHV^^ 
EKVYKKRGlWITLDDYTKKPVLDm 

NAJST^EIiAXjSNGRKCVNRILHILDEFTNIPAI PHMDTKI S IGLGQNILYYLWIQNLKQLVSEYGENTAETIRENCSLK 
VYIKSTAPATNEYFSKELGTRTITRRRRSSNILDEANPNVSIENPRQELLTPTQIiSKLQEGEAVILRGVKGRDNAGR 
KITTDPIFLHEKTSLPYRYMFLQEEFDQSMAL^ 

PQIiASRKQTIKALSQSQFTSPADLTQAVIAEVFDEEDDDDIiFFVDDVM 
SeqID 458 

MTDNRFAQLKENFEKGSPKRRVPTSRPIAAQKAPESYNKKGRYPFSLHQDVRYDKLEALVAYHGAKSASDYLERLIV 
QEWEKMQRKLKNKEK 

SeqID 459 

MAYLSKLSDLDPSLMDADSEQIYIPKVLFEHNDFKGLT^ 
LCELLNQSKTTVISLKKRLAQYGLIEEVKAGSHQPNRIYLTDKLVPYIKG 

SeqID 460 

MTI FDERELKERFTHENRVS F YE FVAKYDAQMVPVMKAKGYRCIHSMERTVVFT FGE FT I RRRRWQKGEHWWPVDE 

KLGLKKKPTRYSLEFMYQIAS!!^^ 

VDMIYLEGDGVMVKAREEGLDNRNVD 

TLLITNSDGGHGYTPYVFKEIAKALKVKQHEH 

EALILMEEEMEGFNQFKRKLLltflSrFQYT^^ 

EGQLRDLFFGSWREDYQKYQEIiENLSAGKIKHEQNKINKRYDLQTIiGRLRYGRHRNL 
SeqID 461 

MAVEI IMPKLGVDMQEGE I LE WKKQVGDVVNEGDVLLEIMSDKTNME I EAEDS GVLLKITHGNGDVVPVTETI GYIG 
AEGEEVTEASSSENTSVEENATQVTSEPEKVEETSEPSVPAATSGEKV^ 
HREDVESFKGAQPRITPLARRIAEDQGVDIAEITGSGIRGKIVKlSrDVLAAMS 
EVIKMSAMRKAISKGMTNSYLTAPSFTLNYDIDMTEN^ 

LNASLINDAQEIELHNFVNIGIAVGLDDGLiIVPVVHNADQMSLSDFVIASKDVIKKTQEGKLKSAEM 
GMFGTKTFNP I INQPNSAILGVGAT I PTPTWDGE I VARP IMAMCLTIDHRI VDGMNGAKFMVDLKNLMENPFGLLI 

SeqID 462 

MS S FNRKKLKFLGI SLATLTATTOT 
DKEGKPKPDLAKKVSVSSDGLTYTATLRDNL 

GDIKDLNKLGOTAKGNQVTFKLTSPCPQFKYYLAFSNFMPQKQSYVEKVGKDY 

FKLVKNKYYWDS KHVKTNS VI VQTI KKPDTAVQMYKQGQ IDFAE I SGT SAI YNQTGS VKALTNQKIRQALNIjATDRK 
GWKAAVDTGSTPAESLVPKKiQAKLPNGEDL^ 

AVDYVKSTWESALPGLTVEEKFVTFKQRLEDAKNENFDVVLFSWGGDYPEGSTFY 
YQKAITTDALKPGDAAITOYKTAEKALFDQSYY^ 

SeqID 463 

atgcttaaaa aatcgcactt tttacagata tttacacttt gcttagccct cttaacgatt 60 

tctggttgtc aattaaccga tactaaaaaa cctggtcata ccacaattaa ggttgctgcc 12 0 

caaagttcta cagagtctag tatcatggca aatattgtca ccgaattaat tcatcacgaa 18 0 

ttaggataca acacaacttt aataagcaat cttggttcct ctacggttac tcaccaagct 24 0 

ttgctccgtg gtgatgctga cattgctgcc acacgttata caggaacaga catcacagga 300 
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actcttggct taaaagctgt taaagacact aaagaagctt ctaagattgt aaaaactgaa 3 60 

ttccaaaaac gctacaatca aacttggtat cctacttatg gtttttctga tacttatgca 420 

ttcatggtta ctaaagagtt tgccagacag aataaaatca ccaagatctc tgatctcaaa 48 0 

aagttatcaa caactatgaa ggcaggggtt gatagttcat ggatgaatcg cgagggagat 540 

ggatacactg atttcgctaa aacatacggt tttgaatttt caaatattta ccctatgcaa 600 

attggcttag tctatgatgc agttgaaagt aacaaaatgc aatctgtatt aggctactcc 660 

actgacggtc gtatttcgag ctatgattta gaaattttaa gggatgataa aaaattcttt 720 

cctccttatg aagcctctat ggttgtcaac aattctatca tcaaaaaaga tcctaaacta 780 

aaaaaattac tccatcgact cgatggtaaa atcaatttaa aaacgatgca aaaccttaat 840 

tatatggtag atgataaact tttagaacct tcagttgttg ccaaacaatt tttagaaaaa 900 

aaccattatt ttagaggaga taaataa 927 

SeqID 464 

ttgaaaccgt ccttcaaaaa actattgcta ctatttagta ttatcaccat tttatccatc 60 

gcctgtacac cacatgcaaa agcaagtggg cgttcatgga aaagttggtt tatcgaacaa 12 0 

tacttttggc taaagcgcga taaaagttat tataaagttc aagatgaaag ctcatttcaa 18 0 

aagtacttaa atgctagccg tgaacaatct gataagggtt attacttaga cccaaactct 24 0 

gttaatggag ggcttgttca agagagactt tttgatatgc aagtctattc ttggaatgac 300 

aacggtaaag ctaatcaaaa gaccattttc tatttagctg gtggctctta cttaaataat 3 60 

ccaacgcctt accatatcag catgcttaag acgctctcaa ctagtcttga tgcaaaaatt 42 0 

attttaccaa tttatccaaa aactcctcgt tatacctatg attatgctat ccctagattg 48 0 

gttaaccttt atagacactt tcatgaaaaa aacgctaatc tcactttgat gggtgattca 54 0 

gctggcggag gattagcact aggattagcg catgcattgt ctcaccaatc agggcaagag 60 0 

gctattcctc aacctaaaaa tattatcctc ttatctccat ggctagatgt gacaatgaaa 660 

catccagaaa ttcccaaata cgaagatact gatcctatcc tatctgcttg gggtctcgca 72 0 

cgtgtaggtg aaatttgggc aaatggttct aataatacta accatacata tgttagtcct 78 0 

aaaaatgccc ctgctactaa actcgctcct attacgttat ttactggaac acgagaaata 84 0 

ttcttcccag atattcgcga ttatgctgca caactccaag cagctaatca ccccgttaac 900 

tatattgcac aagagggaat gaatcatgtt tatccaattt accctattga agaagctaaa 96 0 

acagcacagt atcaaatgat tgatatcatc aataaaactc cttaa 1005 

SeqID 465 

atgacaaaaa aacatcttaa aacgcttgcc ttggcactta ctacagtatc agtagtgaca 60 

tacagccagg aggtatatgg attagaaaga gaggaatcgg tcaaacaaga acaaacccag 12 0 

tcagcttcag aagatgattg gttcgaagaa gataatgaga ggaaaacaaa tgtttctaaa 18 0 

gggaattcta ctgttgatga aacagttagt gatttatttt ctgatggaaa tagtaataac 24 0 

tctagttcta aaaccgagtc agtggtaagt gaccctaaac aagtccccaa agcaaaacca 300 

gaggttacac aggaagcaag caattctagt aatgatgcta gcaaagtaga agtaccaaaa 3 60 

caggatacag cttcaaaaaa ggaaaccata gaaacatcaa cttgggaggc aaaagatttc 42 0 

gtaactagag gggatacttt agtaggtttt tcaaaatctg gaattaataa gttatctcaa 480 

acatcacact tagttttacc aagtcatgca gcagatggaa ctcaattgac acaagtagct 54 0 

agctttgctt ttactccaga taaaaagacg gccattgcag aatatacaag taggctagga 60 0 

gaaaatggga aaccgagtcg tttagatatt gatcagaagg aaattattga tgagggagaa 66 0 

atatttaatg cttaccagtt gactaagctt actattccaa atggttataa gtctattggt 72 0 

caagatgctt ttgtggacaa taagaatatt gctgaggtta accttcctga aagtctcgag 78 0 

actatttcag actatgcttt fcgctcacatg tctttaaaac aagtaaagtt accagataac 84 0 

ctgaaggtca ttggagaatt agcttttttt gataaccaga ttggtggtaa gctttacttg 900 

ccacgtcact tgataaaatt agcagaacgc gctttcaaat ctaatcgtat tcaaacagtt 960 

gaatttttgg gaagtaagct taaggttata ggagaagcaa gttttcaaga taataatctg 102 0 

agtaatgtta tgcttccgga tggacttgaa aaaatagaat cagaagcttt tacaggaaat 108 0 

ccaggagatg aacattacaa caatcaggtc gtattgcgca caaggacagg ccaaaatcca 114 0 

catcaacttg cgactgagaa tacttacgtc aatccggaca aatcattgtg gcgtgcaacg 12 0 0 

cctgatatgg attataccaa atggttagag gaagatttta cctatcaaaa aaatagtgtt 126 0 

acaggtfcttt caaataaagg cttacaaaag gtaagacgta ataaaaactt agaaattcca 1320 

aaacaacaca atggcattac tattactgaa attggtgata acgcttttcg caacgttgat 13 8 0 

tttcaaagta aaactttacg taaatatgat ttggaagaag taaagctccc ctcaactatt 144 0 

cggaaaatag gtgcttttgc ttttcaatct aataacttga aatcctttga agcaagtgaa 15 0 0 

gatttagaag agattaaaga gggagccttt atgaataatc gtattggaac tctagacttg 156 0 

aaagacaaac ttatcaaaat aggtgatgct tctttccata ttaatcatat tcacgccatt 162 0 

gttcttccag aatctgtaca agaaatagga cgttcagctt ttcgacaaaa tggtgcactt 168 0 

cacctgatgt ttatcggaaa taaggttaaa acaattggtg aaatggcttt tttatccaat 174 0 

aaactggaaa gtgtaaatct ctctgagcaa aaacaattaa agacaattga ggtccaagct 18 0 0 

ttttcggata atgcccttag tgaagtagtc ttaccgccaa atttacagac tattcgtgaa 18 6 0 

gaggctttca aaaggaatca tttgaaagaa gtgaagggtt catctacatt atctcagatt 192 0 

gcttttaatg cttttgatca aaatgatggg gacaaacgct ttggtaagaa agtggttgtt 198 0 
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aggacacata ataattctca tatgttagca gatggtgagc gttttatcat tgatccagat 2 040 

aagctatctt ctacaatggt agaccttgaa aaggttttaa aaataatcga aggtttagat 2100 

tactctacat tacgtcagac tactcaaact cagtttagag aaatgactac tgcaggtaaa 2160 

gcgttgttat caaaatctaa cctccgacaa ggagaaaaac aaaaattcct tcaagaagca 2220 

caatttttcc ttggtcgcgt tgatttggat aaagccacag ctaaagctga gaaggcttta 2280 

gtgaccaaga aggcaacaaa gaatggtcat ttgcttgaga ggagtattaa caaagcggta 2340 

ttagcttata ataatagcgc tattaaaaaa gctaatgtta agcgcttgga aaaagagtta 2400 

gacttgctga cagatttagt cgagggaaaa ggaccattag cgcaagctac aatggtacaa 2460 

ggagtttatt tattaaagac gcctttacca ttgccagaat attatatcgg attgaacgtt 2520 

tattttgaca agtctggaaa attgatttat gcacttgata tgagtgatac tattggcgag 2580 

ggacaaaaag atgcatatgg taatcctata ttaaatgttg acgaggataa tgaaggttat 2 64 0 

catgccttgg cagttgcaac tttagctgat tatgaaggtc tttatatcaa agafcatttta 2 700 

aatagttccc ttgataagat taaagcaata cgtcagattc ctttggcaaa atatcataga 2760 

ttaggatttt tccaagctat ccgaaatgca gcggcagaag cagaccgatt gattcctaag 2 820 

acacctaagg ggtacctaaa tacagtccca aattatcgta aaaaacaagt ggagaaaaat 2 880 

tcaaaaccag ttgattataa aacgccgatt tttgataagg ctttacctaa tgaaaaggta 2 940 

gatggtgata gagtggctaa aggtcataat ataaatgcgg agactaataa ttatgtagct 3 000 

gtaacaccaa taaggtccga gcagcaatta cataagtcac agtctgatgt aaatttacct 3 060 

caaacaagtt ctaaaaataa ttttatatac gagattctag gatacgttag tttatgtttg 3120 

cttttcctag taactgctgg gaaaaaagga aaacgagcaa gaaaataa 3168 

SeqID 466 

atgaacaaaa aaattgccgg gatcggcttg gcttcgattg cagtacttag tttagctgca 60 
tgtggacatc gtggtgcttc taaatctggt ggtaaatcag atagcttgaa ggttgcaatg 120 
gtaacagata ccggtggtgt tgatgataaa tcatttaacc aatctggttg ggaaggtatg 180 
caagcttggg gcaagaagaa tggccttaaa aaaggagctg gttttgacta tttccaatcg 240 
gcaagtgaat ctgattatgc aactaactta gatacagctg tgtctagtgg ttataaattg 3 00 
attttcggta ttggattttc tcttcatgat gctattgata aagcagcaga caataacaaa 3 60 
gatgttaatt acgtcatcgt tgatgatgtt attaaaggga aagataatgt tgcaagtgtt 42 0 
gtctfctgcgg ataatgaatc agcttactta gcaggtattg cagccgctaa aactaccaaa 48 0 
acaaaaacag ttggctttgt aggtggtatg gaatctgagg ttattacccg ttttgaaaaa 540 
ggttttgaag caggtgtcaa atcagttgat aaatcaatta aaattaaagt tgactatgct 600 
ggttcattcg gtgatgctgc taagggtaag acaattgcag ccgcacaata tgcttctggc 660 
gcagatattg tttatcaagt agctggtgga actggtgcag gtgtattcag tgaagcaaaa 720 
tcacgcaatg aatctctaaa agaagctgac aaagtttggg tattaggtgt agaccgtgac 780 
caagcagcag aaggtaagta tacttctaaa gatggtaagg cttctaactt tgttttggct 840 
tcatcaatta aagaagttgg aaaatcagtt gaattgattg ctactaaaac aagtaaaggt 900 
aaattcccag gtggtaatgt aacaacttat ggtttgaagg atggtggagt tgatatagca 960 

actacaaacc tttcagatga cgccgttaaa gcgattaaag aggctaaagc taaaattatt 1020 

tcaggagata tcaaagttcc tagcaaataa 1050 

SeqID 4 67 

atgaggattc aaacgacgaa ccttgataaa ggtggcggtg gaaaaagcag tcacacctat 60 
aacgaaggag attggttgtc tagagttatg gggaaacgag tcttgttgat tgatggggcc 120 
agagaatgta atttaacaca ctcctttgat gtgacgagcg acaaaacaat ctatgatatt 180 

tttatcacag gagaatatga gatttgtcga atcaatgata agcttagctt aattagagga 240 

gatgaacgac taactgatga acagctagat ttagccagtc gaaataataa ataccttcag 3 00 

ttattcatgt ggttcagtga gcattatgag gaactggccg aacagtttga tattatcttg 3 60 

attgatactc ataatgatga aagtcttgtt actgccaatt gtattgccgt ttctgatatt 420 

gtcctagggg tgactgatgc gtcaactaat ggatttcgtg cctggttagc ccttaaaaag 480 

tttgtcgatg acatcaagaa tgaagctatt gaggttatca caaagaaaag ctatgtgaag 540 

gcagaagctt atctgattgg caataagatt gagtattatg gaaataatgt gacagagact 600 

tgtaagcaat tcctggatgt tgtgcaggaa gatgaagctt atttgggttc aattcagaag 660 

aaagaactga tggccaaaag tttagtcatt aatcagagtg tctttgaaca acgtgaacac 720 

atgacagaaa aacagaagca agatcatcag aaattttatg ataatataga atatgtctat 780 

caaaatattt taaccgtact agaaggagag actaaatga 819 

SeqID 4 68 

gtgaaaaaag ttgttttctt gaaaatactt gttccgcttg tctctttggt ctttcttatt 60 

agcttgggta taacgatatt agcaggtgtt ttaggagcgg ttggtggttc gcagtcaaat 12 0 

tgtacgatag aagttgtaac ctcaacctct acagactcgt ccatatcttc aggtgatggt 180 

tccattgatt cttttgttaa ggagcacaaa gaggcttata tcctgtcttg gaaagctggt 240 

ggcttcttac catctgcatc gattacgcag accatgattg aaaatggctt taacttcacc 3 00 

aatcctaatg ggacctccct ctggcaagcc cataatatgg gaggggttaa aacgtctagt 3 60 

aagagtaact tccccgttac cttagcaact tatggccaag atagtgttga cctttcagga 420 
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acaaagccag 
tatgatgcag 
gcaattaaca 
actgacggaa 
gaatggctgg 
gcttcagctg 
gattcttctt 
tggggatata 
tcgcttggca 
gctggacaat 
tctgtcagtg 
gtcaaaacga 
ggccatactg 
aatacgccat 
tggacacctg 
cctaaacttg 



gagcaagagt 
ggattgtcgg 
atatagatgg 
cttataaaac 
ataaggaagc 
gagatgctgg 
caggtggtgc 
ggcctgataa 
tggcttatgc 
gtgttaattt 
gcaatggaaa 
ctcctaaaag 
ggattgtgtg 
tatcgggagc 
cccaacaaaa 
gaaactaa 



cggcgatggc 
caaggcagag 
gatagctaca 
caagcttatt 
tatcagcgct 
actatctgat 
cactgatggt 
cctaccagag 
aaactcttct 
aacaatcagt 
agaecaagct 
aggagctatc 
ccatgtcttt 
tgattatttc 
agcagaagtc 
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acaggtgggg 
tttatggccc 
ctaagtgcga 
gacatgtatc 
catggtgata 
attfcttacaa 
acaggaacag 
agtctgaagc 
ggctggttta 
cttggaaatc 
ggtgcttggg 
ttttctaccc 
gaaaatggat 
aaaaaatctt 
attagttttg 



cttatacctg 
atcaaactct 
ttgcggatgg 
ataagcttgg 
aaccttataa 
gcaatagtaa 
ttccagcaga 
agtatatcat 
atccaggfcag 
acctctgggg 
ccaatatctt 
aaaacggagg 
caatcttgat 
acacatggaa 
cttacccata 



gttttcaagc 
ctataccaag 
tggttgggcg 
taaaacctat 
agcaaccaca 
agattgtagt 
tgcgatagcg 
tgatcctaaa 
tgattatcta 
tcactcggga 
tggcaattca 
tggcggttat 
tgtggaacaa 
ttaccgtatt 
tggtaaagaa 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1338 



SeqID 4 69 

atgggaaaga cattttggaa agatatctat cgatcaatta cgacttcaaa aggtagattt 60 

tcatcaatcc tct tact tat gatgttaggc tcatttgctt ttataggact gaaggtctca 120 

gcacctaata tgcaaagaac agcacaaaat taccttgctc atcatcatgt tatggatatt 180 

accgttttca attcttgggg acttgacaaa catgatcaaa ctgttttaga aagcttaaaa 24 0 

ggatcacaag ttgaattttc ttactttgta gatacaaccc ctcaacagaa tagtaaatct 300 

tatcgattgt attccaatac caaaacaata tcaacctttg atcttgttaa gggacgtctc 3 60 

cctttaaata aatcagaaat tgctctctca tttcaggaac gaaaaaaata tgcaataggt 420 

gataaaatta attttaaaca agataaaaac aaactgtttt caaatacagg tcctctaact 480 

attgtaggtt ttgttaattc taccgagatt tggtctaaga caaaccttgg tagttctcaa 54 0 

acaggggacg gcgacttaga tagttatggc gtattagata agacagcatt tcactctcca 60 0 

gtatatacaa tggcaagggt aacttttaaa gatcttaggt tgattaaccc tttctcaata 66 0 

agttacaaag aaaaagtagc gaaataccaa gaaaaggtat cgcgaaaatt aaatattcac 72 0 

aataaaataa gatataccaa aacaaaaaaa gagagcctac gtaaaatcga tgaggaggaa 78 0 

aaaagcctac taaaagctca aaaacaaatc aatcggttag ataatgatag cctagcgatg 84 0 

ccactatctc aaagacaagc tattcaaatg aaaattaaac aagaccgact atctcttttg 90 0 

aaacgaacaa aagagcttct aaaattgaga cataacaccc aaataatgga atcgcctcaa 96 0 

attattgtct acaaccgtac tacctttcct ggaggacaag gatataacac ttttgactcc 102 0 

agtacaaata gcacttctaa aatcagtaat cttttcccca ttattttata tttagtcgca 108 0 

gcattagtaa ccttaacaac aatgactagg ttcgtcgagg aagaaagaac caacgcaggc 114 0 

atattaaaag cactaggcta tagtgaccgc caagttatct ttaaatttat tatttatggt 120 0 

tttatagcag gaacattagg gactactttg ggtatcattg gagggcatta tcttttacct 126 0 

cgtattattt ctgatatcat ttccaaagac ctgactattc caaacacaca gtatcatctt 132 0 

ttcttaaatt acagtttact agcctttgtt ttttcattat taagcattgt tcttccagtt 13 8 0 

tttgttatca ctcgacgtga gttgaaagaa aaagcagcat ttttattatt accaaaacca 144 0 

ccagctaaag gatctaaaat agctfctagag tatatcaatt ggatttggaa aaaactgtcc 150 0 

ttcacacaaa aagtaactgc tagaaatatt tttcgttaca aacaaaggat gattatgacc 156 0 

atatttggag ttgctggatc agtagcgctc ttattttcag gtctaggaat acaatcctcc 162 0 

ttaaagcaaa ccgtcaatga acatttcggt cgtattatgc cctatgatat attgctaaca 168 0 

tataacacaa atgcttctcc cccaaaaata cttgaactac tttcaaaaga ttcaaaaata 174 0 

gacaaatacc aaccaataca ccttgaaaat cttgacgaat ccattcctgg acaaattaat 18 0 0 

aaacagtcaa tttccctatt cataacagac aaaaaacaat tactaccttt tatttattta 186 0 

caagaagcaa caactaataa gtcattgcac ttgaataata aagggattat tatttctaaa 192 0 

aaattagctc aattttatca tgttaatacc ggtgatttta tccatttatc tcattcacaa 198 0 

acacttcctt ctagaaaatt aaaaataaca ggagttgtca atgcgaatgt tggtcactat 204 0 

atttttatga caaaacaata ctatcgaact atttttaaga aagaaactaa agataacgct 210 0 

tttttggtta agttaaccaa acataaaatc gcaaataact tagcagaaaa acttttagaa 216 0 

attaatggag ttgagtctct tacgcaaaat gctcttcagc tagctagtgt agaagccgtt 222 0 

gtgcgttctt tagatggctc tatgactatt ttagttgtcg tatctttatt gttagccatt 228 0 

gtaattctct ataatctcac taatattaat ttagcagaac gtaagcggga gctatcaact 234 0 

ataaaggtat taggttttta taatgaagaa gtaactttat acatttaccg cgaaaccatt 240 0 

attttatcaa ccatcggtgt gattctaggc accattagcg gtacttattt acaccgtcaa 246 0 

atgatgttgc taattggttc agatcaaata ctttttggtg aaaaagtatc accaactaca 252 0 

tttataatac caataagcgt agtagtcatc attctaataa gtctatgttt tatagttaac 258 0 

catcaattaa aaaaactcaa tatgttagat gccttgaaat cagtagatta g 2631 

SeqID 470 

atgaataaaa gaaaattatt aacagtctta tcaattgcat cagttgcagt tttatcaagt 60 
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gttacttttg cagatgaaat aactccatct gatccaacag caccatcaac accccttatt 120 

actccaaccc cagtagatcc tggaacacca actgatacga caaaacctag tactccagta 180 

gatcctggaa cgccaactga tacgacaaaa cctagtactc cagtagatcc tggaacacca 240 

actgatacga caaaacctag tactccagta gattcaggga cttcgactaa tccctcagat 3 00 

tcagaagcaa caaatacaaa tctaggtaca aaaccaacag atgtagttaa gccagagaca 3 60 

cctaaaatcc ctgaagtgcc gtcaacagta actccagatg tgattgacaa ggtagacaaa 420 

gaaacaggta atattacgat taaaccaatt caaatcagcc ctgataaaac gattattgga 4 80 

acacaaaatg gtaatgtcat tattcgagat agttcaggca gccacttagt atctgcaagt 540 

gaagttggtg gtcgacttaa tgaagatggg acagtaacaa ttaaagattc gtcagggaaa 600 

acaaaaacac tccctcgtac tggagaagct aaaggattct taagcattat tggtgccaca 660 

atcctatctt ttgtagctta tttattcaag cagagagtaa ctttaaatta a 711 

SeqID 471 

atgagaccta tcgttagaaa aaagatgtac aaaaaaggta aatttttggt cattgcaggt 60 

atcgttacta ttttaggtgg tagtgcaatt cttggtcaag atgtcaaggc agaacaggca 120 

gaagcagcaa cttcaacgat ctcagagaaa actgatagca gtcagactat ttcggatact 180 

tccaagttaa cgttacctgt taattcctcg gaggctatga aaaatagtgc agagccttta 240 

ataaaaacag gctttgctac atctgtttca tctaatccta gagagatagc agctacccca 3 00 

gtcaaaacct ttgatgctag ttcaaaagtt gttgtaaaag ctagtactgc tgagcactca 3 60 

gcaaatcaaa caaattctaa tgttaatcaa gtagctaatg atagtgaagt tattactcaa 42 0 

caaaattcaa caaagcaatt accaaaagtg acttactctg ctcatgttca agatattgga 4 80 

tggcaaaagt cggttgataa tgcaacagtt tcaggaacag taggacagga aaagcaggtt 54 0 

gaggcaataa aattgtctat taaagcccct gaaggaataa caggaaaact tagctataaa 6 00 

acttatgtta aaggacaagg ttggcaacct tcagtagaga gtggtcaagt tagtggtact 660 

gttggtcaga gtagacctat tgaagcacta agcattaacc ttactgataa tttacaaaaa 720 

ttatatgatg tctattatcg tgttcatgtt caagatatcg ggtggatggc ttgggctaaa 780 

aatggagctt acgctgggac tttgggaatg tcaaagcgcc ttgaagctta cgaagtaaag 84 0 

tttaccctta aaggacaatc tgttttaaca ccaactatac ttaaggaaga gagacctgtc 9 00 

ttaaattatc aggttaaggt aggccaaaat ggttggcaat ctaataaact tgaaggacaa 960 

atggcgggaa ctttaggaga atccaaagcc ctagatggag taaaattcac tcttagcacc 1020 

ttgaagtatg gggacattct ctatcgtact catgtacaag ataaaggttg gggagaaatt 1080 

ttaggtagtc aaacaagtaa ggattatcaa ggaaaaagac tagaagctat tcaattggac 114 0 

ttaagtggga atttaaaaca aaattatgat atttattatc gcgctcacgt ccaagataaa 12 0 0 

ggttggatgg cttggcaaaa aaataatagt gtagcaggta cagtcggaga atcgaaacgt 12 6 0 

ttagaggctt tggaagtaaa actagtagct aaagcttcag attttagttc ccaaacaaat 1320 

cattcattct tagaaatgaa gcatcctgga ttaagttatc aaacttatct tcaaaaagat 13 8 0 

ggttggaaac caacggtttt ggaaggccaa ttaggaggta gtattggact ttcaaaatct 144 0 

ataaaagcca tcaaacttaa cttaggtagt acagctctag gaaatataga atatagaact 15 0 0 

tttcttaatg gttcaggttg gcaaactgtt gttaattcag ggagagaaag taatgtacca 15 60 

aatgaatctc aacaagtaga agctattcag atgcgattga caggtctgct atcgaaaacc 162 0 

tatagtgtga tgtatcgagt tcacatgcaa gattatggtt ggcaggattg gacttataac 168 0 

aatgacatcg ctggtagtac taaccaagga aaacgaattg aagcaatcga aattaaatta 174 0 

gttgagactg ctaaaatacc tataactgtc caaacgacct ataagggaac tggaaactat 18 0 0 

aatgtacgag tgacaaatgt tattagtcct gggaatttaa aaattgctat ttggtcagat 18 60 

aaaaacaatc aggatgacct taagtggtat accgtcacac ccaaaaatca tgaagctaga 192 0 

ttgacattta atgttacaaa tcatagagac aatggtaaat actttatcca tgtttatcag 19 8 0 

gaaaatgctt ctaatcaaaa acaattattg gtaaaaactt ctctacaagt cgcacattca 204 0 

aattataata ctccttactt tagccaacgt gatggaaggt gggcaagtcg aagatatggt 210 0 

atggcaactt taggagaaac tggttgtgta ccaacttctc tagctatgat tttgtcttct 216 0 

ttgaaaggag aaactgtttt accaacgcaa attgcagatt acctttacca taagacagtt 222 0 

gaatttaatc gtggtgttca aggaacaact tctagaggaa tcttaaaagc agccaaagaa 228 0 

tggggtgtaa cagcaactgc attaggaact caagctaacc ttgtacaagc gttgaaggat 234 0 

ggttatcatg tcctagcggc agtacaaaat aatgtttttg ttcttcatgg aagtcatgaa 24 00 

atcgttctca aaggatataa taacggactg acacatgtct ctgaccctta cacaccgtcc 2460 

ttgtctggtt ggtatccaat ttcacaactt tggaaagaac agagctatta tagtgaggat 2520 

agaattgata ttggagaacc ttttgtcaag gtaacagatg cctaa 2565 

SeqID 4 72 

atggcaagaa tcccttatca gaaaacactt attttttgtg tagccgttat tattgctatt 6 0 

tttattttgg gattatctaa ggatttagca caatctaaag ggcaaaaggt tgcaaataat 12 0 

aatactgtaa aaacagcacg tgtggttgca aacggagata ttttgctgca cgatgtacta 18 0 

tatgccagtg cgcgccaacc cgacggcact tataatttta ctccttattt taaagaggta 24 0 

aaatcttgga ttgagagcgc tgatttagcg attggtgatt atgaagggac gattagttca 3 00 

gaatatcctt tagcaggcta tcctttattt aatgctccta atgagattgc aactactatg 3 60 

aaagaaactg gatatgatgt cgttgactta gcacataatc atattctgga ttcacaactg 42 0 
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gcaggtgcta tcaatactgt caagactttt aatcgtatgg gattggatac tattggggtt 4 80 

tatttaaagg atagaaataa agaagatatc ttaattaagc gtgtaaatgg tataaaaatt 540 

gctattttag gttattcgta tggctacaat ggtatggaag ctaatgtttc aaagagtgac 600 

tatgaaaaac atatgtctga tttagatact aaaaagatta agcaagatat taaaaaggct 660 

gaaaaagaag cagatatcac cattgtcatg ccacagatgg ggatagaata tcaaaaaaaa 720 

cctactacag aacaagtaat gctttatcac agtatgatta aatggggggc agatattata 780 

ttcggaggac accctcacgt tgttgaaccg tctgaagtga taaaaaaaga tggccagaaa 840 

aaatttatta tttactctat gggaaatttt atttcaaatc agcgtttaga aacagttgat 900 

gatatctgga cagaaagagg attattaatg gatgtaacaa tagagaaaaa aggtcaaaaa 960 

acagtaatta aaaaagttaa ggcgcatcca acactagtag aagcaaaagc aaatggccgt 1020 

tatgttgaag aaggctttcc agcttatgat tataggacgc tagttttaga ggattatatt 1080 

gaaggtggta agaaacgttc tacgattgat cagactatga aagataaagt tgataaagca 1140 

tataaggaga tgaatgagtt agttaatttg aagtggtaa 1179 

SeqID 473 

atgaaacatt tctttgatta caagacaatg tctgtcgcac gtaaactcag cattagtttt 60 

atcgctgtta tcctattggg aagtatttta ctttcacttc caatttttca atatgctaac 120 

gcccctaaaa cacattacat tgatcacctt ttcacgactg tttcaatggt atgtgtaact 180 

ggtttatctg tttttcctat cagtaaggtt tataacggtt gggggcagat tgtagctatt 240 

ttactcatgc aaacaggggg acttggttta gtaacactaa tgtcgttgtc ttactatacc 300 

ctaagacgta aaatgtcgtt aaatgatcag acccttttgc agtcagcgat aacttataac 3 60 

agttcgacag atttaaaaaa atatctctat atgattttta aagtaacttt aactttagaa 420 

gttttagcag ccagcatttt agccatagat tttattcctc gatfctggttt gggtcatggt 480 

atctttaata gtatattctt agctgtctca gccttttgta atgccggttt tgataatcta 540 

ggggcaacga gcttagctca atttaaacta aatcctttgg ttaacattat cgtttgtttc 600 

ttgattattt caggaggact tggatttgcc gtatggaaag atttaattga agctactata 660 

caaacatctc acaagggacc aaaattaatc aagacatttc caaaacgcct cagcaaccat 72 0 

tctaaactgg ttttaaaaac aacaaccatt attttactaa caggaacttt gctttcttgg 780 

ttgttagagt ttggaaattt tagaacaatt gctaatttat ctctacctaa acaactaatg 84 0 

gtcagctttt tccaaacagt tacgatgcgt accgctggat tttcaactat tgattatacc 900 

caaactgatt tcgcaaccaa ccttgtttat attatccaaa tgttaattgg aggagctcct 9 60 

ggtggaacgg caggaggatt taaagtcact gttatcgcca ttttactctt actatttaag 1020 

gccgaactga gtggccaaag ccaagtgact tt teat tat c gaacaattcc ttccagtatt 108 0 

attaaacaaa cactatcgat tttaactttc ttttttatca tacttattag cggctatctc 1140 

cttcttctgg aactcaatcc ccatatagat cctttttctc ttttctttga ggcatctagt 12 00 

gecttagcaa ccgtgggtgt gacaatgaac actaccaatc aattaagtct gggeggtega 12 60 

attgtaatta tgttcctcat gtttattggt egagteggtc caattaccgt actgetatea 1320 

attcttcaga aaaaagagaa ggaaattcac tatgetgaaa cagaaattat tctaggttag 13 8 0 

SeqID 474 

atgttagacg aattaaaagc aatcataaaa ageccaaaat tatggattac gatggctggg 60 

gttgcactga ttccaacgct atataatgtt atctttctaa gctcaatgtg ggatccttat 12 0 

gggaatacta aaaatttacc tgtggcagtc gttaaccaag ataaategge taaattaaat 180 

ggaaaaacta tttcaattgg caaagatatg gaggacaatt tgtctaaaaa tgatagctta 240 

gatttccatt ttacaacagc taaaagagca gaaaaagaat tagaaaaggg tcactattat 3 00 

atggttatta cctttcctaa agatctttct agaaaagega caactctcat gacagaaaaa 3 60 

ccagaaaggt taaatattac ttataaaact actaaggggc gtagttttgt agcttcaaaa 420 

atgagtgaga cagcagctaa taaactaaaa gatgaggtag cagaatctat tacagggact 4 80 

tatactgaat cagtttttaa aaatatgggt tctatgaaaa caggtatcaa taaagctget 540 

gatgggagtc aagaattatt aaatggttct aataagctac aggatgggag ccaaaccttg 6 00 

acaagtaatt tagatgtttt agcaagctct agtcaaactt tttctggagg agctaataaa 660 

ctaaatagtg gaataaacct ttatacagat ggtgtgggga ccctatctaa cgggttagaa 72 0 

acattatcag atggtgttac tgcttacaca acaggtgtgc ataagttgag tgaaggtagt 780 

caaaaattag atgataagtc tcaagcattg gtggaaggat cagaaaagtt gacggatggt 840 

ttacaacaat tgtcacaagc aactcaacta aaaccagagc aagaaegtae tttgeagaac 90 0 

ctttcagatg gattaaagaa tctaaatcag attattacaa atttacaatc gacagcaact 9 60 

actgatagcg atactaatag taagttattt aattttttat caactattga atcaagtaca 102 0 

aaggctttga tgaataegge tgetgetgat aaacaaaagc aaatgactgc ggtacaatca 108 0 

aegtcagett ttaaaagttt aacccctgaa caacaatctc aaatcacctc ageggttact 114 0 

ggaacaccta cttcagctga gaeaattgea gcaaatattt cttcaaatat agaaaatatg 12 0 0 

aaaactgtgc ttagtgaagc atctagttca gccccttcta ataatggatc acaaaatctt 12 60 

caaactttgt caggaacagc taacaattta gtattaaaag ctatctcaga tttagataaa 132 0 

attcaaaaat tgccaacggc tactaagcag ttatatcaag gtagtcaaac cttgacaaaa 13 8 0 

ggaatcaegg attatacgaa tgctgtaggg caattacgea aaggtgcagt aactttagat 144 0 

agtaaatcca atcaattgat ttcaggaaca caaaaagcaa gtcagggagc tcaaacttta 15 0 0 
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gatagtaaat cggatcaatt gagagacgga gcggggcaat tagctagtgg ttctgataga 156 0 

attgcagatg gttctaataa gcttgctggg ggaggacatc agttgataga tgggttgaca 162 0 

gagttgtcag gtggtgtttc tcaattatca agttctttgg ggaaagctgg ggaccagttg 168 0 

tccatggttt ctgtaaataa agataatgcc aatgcagtat ctagcccagt cactafccaag 174 0 

cafcgaagatt atgatagcgt tgatacgaat ggtgttggaa tggcaccata catgatttca 18 0 0 

gttgcgttga tggtggtagc tttatcagcc aatgttattt ttgcgaaagc tttatctggc 18 6 0 

aaagagccag ctaatcgttt tagttgggct aaaaataaat fcattaatcaa tggattcatt 192 0 

gcaactctag cagcaactat cttatttttt gcagttcaat tcataggtct taaaccagat 198 0 

taccctggaa aaacctactt tattatccta ttgacagcat ggactttgat ggcattagta 2 04 0 

actgctttag tgggatggga taataggtat ggttccttct tgtcgttatt aatattatta 210 0 

ttccagcttg gttcaagcgc aggaacttac ccaatagaat tgagtcctaa gttctttcaa 216 0 

acaattcaac catttttacc gatgacttac tctgtttcag gattaagaga gaccatctcg 222 0 

ttgacgggag acgttaacca tcaatggaga atgctagtaa tctttttagt atcatcgatg 228 0 

atacttgctc ttcttattta tcgtaaacaa gaagattaa 2319 

SeqID 475 

MLK3CSHFLQIFTLCLALLTISGC 

HQALLRGDAD IAATRYTGTD ITGTLGLKAVKDTKEAS KI VKTE FQKRYNQTWYPTYGF SDTYAFMVTKE FARQNKIT 

KISDLKKLSTTMKAGVDSSWlVnS^ 

ILRDDKKFFPPYEASIWVNNSIIKK^ 

SeqID 476 

LKPSFKKLLLLFSIITILSIACTPHAKASGRSWKSWFIEQYFWLKRDKSYYK^QDESSFQKYLNASREQSDKGYYLD 
PNSWGGLVQERLFDMQVYSWITONGKANQKT^ 

YAX PRLVNLYRHFHEKNANLTLMGDSAGGGLALGLAHALSHQSGQEAI PQPKNI ILLS PWLDVTMKHPEI PKYEDTD 
P I L SAWGLARVGE I WANGS1STNTNHT YVS PKNAPATKLAPITLFTGTRE I FFPDIRD YAAQLQAAHHPVNYI AQEGMN 
HVYPIYPIEEAKTAQYQMIDIINKTP 

SeqID 477 

MTKKHLKTLAIjALTTVSVVTYSQEVYGLEREESVKQEQTQSASEDDWFEEDNERKTNV 
SNNSSSKTESVVSDPKQVPKAKPEVTQEASNSSOT 

INKLSQTSHLVIiPSHAADGTQLTQVASFAFTPDKKTAIAEYTSRLGENGKPSRLDIDQKEIIDEGEIFNAYQLTKLT 

IPNGYKSIGQDAFVDNKNIAEWLPESLETISDYAFAHMSLKQVKLPDNLKVIGELAFFDNQIGGKLYLPRHLIKLA 

ERAFKSNRIQTVEFLGSKLKVIGEASFQDmLSNVMLPDGLEKIESEAFTGNPGDEHYlSINQV^ 

ENTYWPDKSLWRATPDI^YTKWLEEDFTYQKNSOT 

SKTLRKYDLEEVKLPSTIRKIGAFAFQSIOTLKSFEASEDL 

IVL PE S VQE I GRS AFRQNGALHLMFI GNKVKT IGEMAFLSNKLE S VTSTLS EQKQBKT I E VQAFSDNALS E WLPPNLQ 
TIREEAFKRIflHLKEVKGSSTLSQIAFNAFDQNDGDKRFGK^ 

KIIEGLDYSTLRQTTQTQFREMTTAGKALLSKSNLRQGEKQKFLQEAQFFLGRVDLDKATAKAEKALVTKKATKNGH 
LLERSINKAVLAYNNSAIKKAEW^ 

LIYALDMSDTIGEGQKDAYGNPILNVDEDNEGYHALAVATI^ 

QAI RNAAAEADRL I PKTPKGYLNTVPNYRKKQVEKNS KPVDYKTPI FDKALPNEKVDGDRVAKGHNINAETNNYVAV 
TPIRSEQQLHKSQSDVNLPQTSSKMSTFIYEILGYVSLCLLFLVTAGKKGKRARK 

SeqID 478 

MNKKI AGI GLAS I AVL S LAACGHRGAS KS GGKSD S LKVAMVTDTGGVDDKS FNQSGWEGMQAWGKKNGLKKGAGFDY 
FQ S ASE SD YATNLDTAVS S GYKLI FGI GFS LHDAIDKAADNNKD VNYVI VDDVI KGKDNVAS WFADNE S AYIiAGI A 
AAKT TKTKT VGFVGGME S E VI TRFE KGFEAGVKS VDKS I KI KVD YAGS FGDAAKGKT I AAAQ YAS GAD I VYQ VAGGT 
GAGVFS E AKSRNE S LKEADKVWVLGVDRDQAAEGKYTS IGDGKASNFVLAS S I KEVGKS VELI ATKTSKGKFPGGNVT 
TYGLKDGGVDIATTNLSDDAVKAIKEAKAKIISGDIKVPSK 

SeqID 4 79 

MRIQTTNLDKGGGGKSSHTYNEGDWLSRVMG]^^ 

I RGDERLTDE QLDIiASRNWKYLQLFMWFS EHYEELAEQFDI I L IDTHNDE S LVTANC I AVSD I VLGVTDAS TNGFRA 

WIALKKFVDDIKNEAIEVITKKSYVEAEAYLIGNKIEYYGNN^ 

QSVFEQREHMTEKQKQDHQKFYDNIEYVYQNILTVLEGETK 

SeqID 48 0 

VKKWFLKILVPLVSLVFLISLGITILAGVLGAVGGSQSNCTIEVVTSTSTDSSISSGDGSIDSFVKEHKEAYILSW 
KAGGFLP S AS I TQTMI ENGFNFTNPNGT S L WQAHNMGGVKT S S KS NF P VTLAT YGQD S VDL S GTKPGARVGD GTGGA 
YTWFSSYDAGIVGKAEFiV[AHQTLYTKAI]SnsriDGIATLSAIADGGWATDGTYKTKL 

GDKPYKATTASAGDAGLSDIFTSNSKDCSDSS SGGATDGTGTVPADAIAWGYRPDNL PE S LKQYI IDPKSLGMAYAN 
S S GWFNPGSD YLAGQCVNLTI S LGNHLWGHS GS VS GNGKDQAGAWANI FGNS VKTTPKRGAI FS TQNGGGGYGHTGI 
VCHVFENGSILIVEQNTPLSGADYFKKSYTW1JYRIWTPAQQKAEVISFAYPYGKEPKLGN 
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SeqID 4 81 

MGKTFWKDIYRSITTSKGRFSSILLLMMLGS 

SLKGSQVEFSYFVDTTPQQNSKSYRLYSNTKTISTFDLVKGRLPLNKSEIALSFQERKKYAIGDKINFKQDKNKLFS 

NTGPLTIVGFVNSTEIWSKTNLGSSQTGDGDLD 

KVSRKLNIHNKIRYTKTKKESLRK^^ 

LRHNTQIMESPQIIVYTSIRTTFPGGQGYNTFDSSTN^ 

GYSDRQVIFKFIIYGFIAGTLGTTLGIIGGHYLLPRIISDIISKDLTXPNTQYHLFLNYSLIiAFVFSLLSIVLPVFV 
ITRRELKEKAAFLLLPKPPAKGSKIALEYINWIWK^ 

SLKQTVNEHFGRIMPYDILLTYNTNASPPKILELLSKDSKIDKYQPIHLENLDESIPGQINKQSISLFITDKKQLLP 

FIYLQEATTNKSLHH^KGIIISK^^ 

K^TKDNAFLVKLTKHKIATO 

LAERKRELSTIKr^GFYNEEVTLYIYRETIILSTIGVILGTISGTYLHRQMMLLIGSDQILFGEK^SPTTFIIPIS^ 
WI ILISLCFIVNHQLKKLNMLDALKSVD 

SeqID 4 82 

rWKRKLLTVLSIASVAVLSSVTFADEITPSDPTAPSTPLITPTPVDPGTPTDTTKPSTPVDPGTPTO 
GTPTDTTKPSTPVDSGTSTNPSDSEATNTNLGTKPTDVVKPETPKIPEVPSTVTPDVIDKTVDKETGNITIKPIQISP 
DKTI IGTQNGNVTIRDS SGSHLVSASEVGGRLNEDGTVTIBCDS SGKTKTLPRTGEAKGFLS I IGATILS FVAYLFKQ 
RVTLN 

SeqID 483 

MRPIVRKiayftTKKGKFLVIAGIOT 

EPLIKTGFATSVSSNPREIAATPVKTFDASSKVVVICASTA^ 

HVQD I GWQKSVDNATVSGTVGQEKQVEAIKIiS IKAPEGI TGKLS YKT YVKGQGWQPSVE S GQVSGT VGQS RPI EAL S 
INLTDNLQKLYDVYYRVHVQDI GWMAWAKNGAYAGTLGMS KRLEAYEVKFTLKGQS VLTPT I LKEERPVLNYQVKVG 
QNGWQSNKLEGQMAGTLGESKALDGVKFTLSTL^^ 
KQNYDIYYRAHVQDKGWMAWQKNNSVAGT^ 

KPTVLEGQLGGS I GLS KS IKAI KLNLGSTALGNI E YRTFLNGS GWQT WNS GRE SNVPNE SQQVEAI QMRLTGLLS K 
TYS Vl>TYRVHMQDYGWQDWTYlSnSIDIAGSTNQGK^IEAIEIKIjVETAKI PITVQTTYKGTGNYNVRVT1TVI S PGNLKIA 
IWSDKJSTNQDDLKWYTVTPKNHEARLTFIWTNHRDNGK^ 

WASRRYGMATLGETGCVPT S LAMI LS S LKGETVLPTQI ADYLYHKTVEFNRGVQGTTSRGILKAAKEWGVTATAIjGT 

QAISTLVQALKDGYHVLAAVQWNVFVLHGSHEIVLKGYNNGL 

FVKVTDA 

SeqID 484 

MARIPYQKTLIFCVAVIIAIFILGLSKD 

KEVKSWIESADLAIGDYEGTISSEYPLAGYPLFNAPNEIATTMK^ 
LDTIGVYLKDRNKEDILIKRWGIK^^ 

QMGIEYQKKPTTEQVMLYHSMIKWGADIIFGGHPHWEPSEVIKKDGQKKFIIYSMGNFISNQRLETVDDIWTERGIi 

LI^VTIEKKGQKTVIKKVKAHPTLVEAKANGRYVEEGFPAYDYRTLVL 

ELVNLKW 

SeqID 485 

MKHFFDYKTMSVARKLSISFIAVILLGSILLSLPIFQYAN^ 
VAILLMQTGGLGLVTLMSLSYYTLRRKMSL^ 

FGLGHGIFNSIFIAVSAFCNAGFDNLGATSriAQFKLNPLWIIVCFLIISGGLGFAVWKDLIEATIQTSHKGPKLIK 
TFPKRLS^SKLVLKTTTIILLTGTL^ 

VYIIQMLIGGAPGGTAGGFKVTVIAILLLLFKAELSGQSQVTFHYRTIPSSIIKQTLSILTFFFIILISGYLLLLEL 
NPHIDPFSLFFEASSAIiATVGVTMQSTT^ 

SeqID 486 

MLDELKAIIKSPKLWITMAGVALIPTL^ 

DSLDFHFTTAKKAEKELEKGHYYMVITFPro^ 

ESITGTYTESVFKmGSMKTGINKAADGSQELLN^ 

VGTLSNGLETLSDGVTAYTTGVHKLSEGSQKLDDKSQAIjVEGSEKLTDGLQQLSQATQLKPEQERTLQNLSDGLKl^ 
NQIITNLQSTATTDSDTNSKLFNFLSTIESSTKALMNTAAADKQKQMTAVQSTSAFKSLTPEQQSQITSAVTGTPTS 
AETIAANISSNIENMKTVLSEASSSAPS^^ 

TDYTNAVGQLRKGAVTLDSKSNQLISGTQKASQGAQTLDSKSDQLRDGAGQLASGSDRIADGSNKLAGGGHQLIDGL 
TEL S GGVSQLS S S LGKAGDQL SMVS VNKDNANAVS S PVTI KHEDYD S VDTNGVGMAPYMI S VALMWALS ANVI FAK 
AL S GKE PANRF S WAKNKLLINGF IATLAATILFFAVQFI GLKPD YPGKT YFI I LLTAWTLMALVTALVGWDNRYGS F 
LSLLILLFQLGSSAGTYPIELSPKFFQTIQPFLPMTYSVSGLR 



